Overview
Brought to you by YData
Dataset statistics
| Number of variables | 93 |
|---|---|
| Number of observations | 338440 |
| Missing cells | 17578413 |
| Missing cells (%) | 55.8% |
| Total size in memory | 240.1 MiB |
| Average record size in memory | 744.0 B |
Variable types
| Text | 93 |
|---|
Dataset
| Description | NMNH Material Samples (USNM) 0049394-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.ycwxgd |
institutionID has constant value "http://grbio.org/cool/142r-0w94" | Constant |
datasetName has constant value "NMNH Material Samples (USNM)" | Constant |
basisOfRecord has constant value "MaterialSample" | Constant |
occurrenceStatus has constant value "present" | Constant |
organismScope has constant value "963.0" | Constant |
eventTime has constant value "94648" | Constant |
eventRemarks has constant value "Guide to Best Practices for Georeferencing. (Chapman and Wieczorek, eds. 2006). Google Earth Pro" | Constant |
geologicalContextID has constant value "(Keferstein)" | Constant |
earliestEraOrLowestErathem has constant value "Chordata" | Constant |
catalogNumber has 70749 (20.9%) missing values | Missing |
recordNumber has 181774 (53.7%) missing values | Missing |
recordedBy has 70194 (20.7%) missing values | Missing |
individualCount has 39392 (11.6%) missing values | Missing |
sex has 176515 (52.2%) missing values | Missing |
lifeStage has 205051 (60.6%) missing values | Missing |
preparations has 251349 (74.3%) missing values | Missing |
associatedMedia has 323875 (95.7%) missing values | Missing |
associatedSequences has 305730 (90.3%) missing values | Missing |
occurrenceRemarks has 193737 (57.2%) missing values | Missing |
organismID has 338436 (> 99.9%) missing values | Missing |
organismName has 338438 (> 99.9%) missing values | Missing |
organismScope has 338439 (> 99.9%) missing values | Missing |
materialSampleID has 85078 (25.1%) missing values | Missing |
eventType has 338436 (> 99.9%) missing values | Missing |
fieldNumber has 267431 (79.0%) missing values | Missing |
eventDate has 16369 (4.8%) missing values | Missing |
eventTime has 338439 (> 99.9%) missing values | Missing |
startDayOfYear has 18131 (5.4%) missing values | Missing |
endDayOfYear has 17911 (5.3%) missing values | Missing |
year has 16370 (4.8%) missing values | Missing |
month has 17966 (5.3%) missing values | Missing |
day has 19384 (5.7%) missing values | Missing |
verbatimEventDate has 236098 (69.8%) missing values | Missing |
habitat has 302334 (89.3%) missing values | Missing |
eventRemarks has 338439 (> 99.9%) missing values | Missing |
locationID has 284922 (84.2%) missing values | Missing |
higherGeography has 4534 (1.3%) missing values | Missing |
continent has 144951 (42.8%) missing values | Missing |
waterBody has 231595 (68.4%) missing values | Missing |
islandGroup has 315692 (93.3%) missing values | Missing |
island has 279541 (82.6%) missing values | Missing |
country has 14430 (4.3%) missing values | Missing |
stateProvince has 66214 (19.6%) missing values | Missing |
county has 140615 (41.5%) missing values | Missing |
locality has 34082 (10.1%) missing values | Missing |
minimumElevationInMeters has 249251 (73.6%) missing values | Missing |
maximumElevationInMeters has 284628 (84.1%) missing values | Missing |
verbatimElevation has 322501 (95.3%) missing values | Missing |
minimumDepthInMeters has 264207 (78.1%) missing values | Missing |
maximumDepthInMeters has 271190 (80.1%) missing values | Missing |
verbatimDepth has 336961 (99.6%) missing values | Missing |
locationRemarks has 338438 (> 99.9%) missing values | Missing |
decimalLatitude has 73885 (21.8%) missing values | Missing |
decimalLongitude has 73885 (21.8%) missing values | Missing |
geodeticDatum has 308301 (91.1%) missing values | Missing |
coordinateUncertaintyInMeters has 327413 (96.7%) missing values | Missing |
coordinatePrecision has 338436 (> 99.9%) missing values | Missing |
verbatimCoordinates has 338436 (> 99.9%) missing values | Missing |
verbatimLatitude has 230082 (68.0%) missing values | Missing |
verbatimLongitude has 230109 (68.0%) missing values | Missing |
verbatimCoordinateSystem has 329369 (97.3%) missing values | Missing |
verbatimSRS has 338436 (> 99.9%) missing values | Missing |
footprintSpatialFit has 338436 (> 99.9%) missing values | Missing |
georeferencedBy has 338436 (> 99.9%) missing values | Missing |
georeferenceProtocol has 255527 (75.5%) missing values | Missing |
georeferenceRemarks has 328933 (97.2%) missing values | Missing |
geologicalContextID has 338439 (> 99.9%) missing values | Missing |
earliestEonOrLowestEonothem has 338436 (> 99.9%) missing values | Missing |
latestEonOrHighestEonothem has 338436 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 338438 (> 99.9%) missing values | Missing |
latestEraOrHighestErathem has 338436 (> 99.9%) missing values | Missing |
earliestPeriodOrLowestSystem has 338436 (> 99.9%) missing values | Missing |
earliestEpochOrLowestSeries has 338436 (> 99.9%) missing values | Missing |
lowestBiostratigraphicZone has 338436 (> 99.9%) missing values | Missing |
formation has 338436 (> 99.9%) missing values | Missing |
identificationQualifier has 333367 (98.5%) missing values | Missing |
typeStatus has 331835 (98.0%) missing values | Missing |
identifiedBy has 226287 (66.9%) missing values | Missing |
scientificName has 24062 (7.1%) missing values | Missing |
higherClassification has 5901 (1.7%) missing values | Missing |
kingdom has 10613 (3.1%) missing values | Missing |
phylum has 36740 (10.9%) missing values | Missing |
class has 12521 (3.7%) missing values | Missing |
order has 30431 (9.0%) missing values | Missing |
family has 18609 (5.5%) missing values | Missing |
genus has 25827 (7.6%) missing values | Missing |
subgenus has 336132 (99.3%) missing values | Missing |
specificEpithet has 33273 (9.8%) missing values | Missing |
infraspecificEpithet has 326664 (96.5%) missing values | Missing |
taxonRank has 326679 (96.5%) missing values | Missing |
scientificNameAuthorship has 174042 (51.4%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:34:21.771505 |
|---|---|
| Analysis finished | 2025-01-14 16:34:33.689585 |
| Duration | 11.92 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 338440 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 338440 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 4501677301 |
|---|---|
| 2nd row | 3027962301 |
| 3rd row | 3028050301 |
| 4th row | 3027962302 |
| 5th row | 3028050302 |
| Value | Count | Frequency (%) |
| 4501677301 | 1 | < 0.1% |
| 3028050302 | 1 | < 0.1% |
| 3041539301 | 1 | < 0.1% |
| 3357130301 | 1 | < 0.1% |
| 3027962303 | 1 | < 0.1% |
| 3758404301 | 1 | < 0.1% |
| 3027962304 | 1 | < 0.1% |
| 3336913301 | 1 | < 0.1% |
| 3028050303 | 1 | < 0.1% |
| 4909491307 | 1 | < 0.1% |
| Other values (338430) | 338430 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 541548 | |
| 3 | 466717 | |
| 9 | 357521 | |
| 2 | 356979 | |
| 8 | 331351 | |
| 4 | 317675 | |
| 1 | 298477 | |
| 5 | 263706 | |
| 7 | 254468 | |
| 6 | 195958 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3384400 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 541548 | |
| 3 | 466717 | |
| 9 | 357521 | |
| 2 | 356979 | |
| 8 | 331351 | |
| 4 | 317675 | |
| 1 | 298477 | |
| 5 | 263706 | |
| 7 | 254468 | |
| 6 | 195958 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3384400 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 541548 | |
| 3 | 466717 | |
| 9 | 357521 | |
| 2 | 356979 | |
| 8 | 331351 | |
| 4 | 317675 | |
| 1 | 298477 | |
| 5 | 263706 | |
| 7 | 254468 | |
| 6 | 195958 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3384400 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 541548 | |
| 3 | 466717 | |
| 9 | 357521 | |
| 2 | 356979 | |
| 8 | 331351 | |
| 4 | 317675 | |
| 1 | 298477 | |
| 5 | 263706 | |
| 7 | 254468 | |
| 6 | 195958 | 5.8% |
modified
Text
| Distinct | 10795 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 2109 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | 2024-06-26 12:37:00 |
|---|---|
| 2nd row | 2021-10-14 09:12:00 |
| 3rd row | 2022-07-20 16:25:00 |
| 4th row | 2021-10-13 15:49:00 |
| 5th row | 2019-06-25 16:21:00 |
| Value | Count | Frequency (%) |
| 2021-05-07 | 24979 | 3.7% |
| 2024-09-05 | 21488 | 3.2% |
| 2022-10-06 | 14730 | 2.2% |
| 2021-10-14 | 13097 | 1.9% |
| 2021-10-13 | 12997 | 1.9% |
| 2024-01-01 | 11156 | 1.6% |
| 2024-10-17 | 10237 | 1.5% |
| 2017-12-07 | 9787 | 1.4% |
| 2023-12-17 | 9667 | 1.4% |
| 2022-07-20 | 9524 | 1.4% |
| Other values (1817) | 539218 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1618219 | |
| 2 | 1049110 | |
| 1 | 803470 | |
| - | 676880 | |
| : | 676880 | |
| 338440 | 5.3% | |
| 3 | 230325 | 3.6% |
| 4 | 213783 | 3.3% |
| 5 | 202489 | 3.1% |
| 7 | 196745 | 3.1% |
| Other values (3) | 424019 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4738160 | |
| Dash Punctuation | 676880 | 10.5% |
| Other Punctuation | 676880 | 10.5% |
| Space Separator | 338440 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1618219 | |
| 2 | 1049110 | |
| 1 | 803470 | |
| 3 | 230325 | 4.9% |
| 4 | 213783 | 4.5% |
| 5 | 202489 | 4.3% |
| 7 | 196745 | 4.2% |
| 6 | 170871 | 3.6% |
| 9 | 151998 | 3.2% |
| 8 | 101150 | 2.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 676880 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676880 |
Space Separator
| Value | Count | Frequency (%) |
| 338440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6430360 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1618219 | |
| 2 | 1049110 | |
| 1 | 803470 | |
| - | 676880 | |
| : | 676880 | |
| 338440 | 5.3% | |
| 3 | 230325 | 3.6% |
| 4 | 213783 | 3.3% |
| 5 | 202489 | 3.1% |
| 7 | 196745 | 3.1% |
| Other values (3) | 424019 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6430360 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1618219 | |
| 2 | 1049110 | |
| 1 | 803470 | |
| - | 676880 | |
| : | 676880 | |
| 338440 | 5.3% | |
| 3 | 230325 | 3.6% |
| 4 | 213783 | 3.3% |
| 5 | 202489 | 3.1% |
| 7 | 196745 | 3.1% |
| Other values (3) | 424019 | 6.6% |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 31 |
| Mean length | 31 |
| Min length | 31 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | http://grbio.org/cool/142r-0w94 |
|---|---|
| 2nd row | http://grbio.org/cool/142r-0w94 |
| 3rd row | http://grbio.org/cool/142r-0w94 |
| 4th row | http://grbio.org/cool/142r-0w94 |
| 5th row | http://grbio.org/cool/142r-0w94 |
| Value | Count | Frequency (%) |
| http://grbio.org/cool/142r-0w94 | 338440 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 1353760 | 12.9% |
| o | 1353760 | 12.9% |
| r | 1015320 | 9.7% |
| g | 676880 | 6.5% |
| t | 676880 | 6.5% |
| 4 | 676880 | 6.5% |
| h | 338440 | 3.2% |
| 1 | 338440 | 3.2% |
| w | 338440 | 3.2% |
| 0 | 338440 | 3.2% |
| Other values (10) | 3384400 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6091920 | |
| Other Punctuation | 2030640 | 19.4% |
| Decimal Number | 2030640 | 19.4% |
| Dash Punctuation | 338440 | 3.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1353760 | |
| r | 1015320 | |
| g | 676880 | |
| t | 676880 | |
| h | 338440 | 5.6% |
| w | 338440 | 5.6% |
| l | 338440 | 5.6% |
| c | 338440 | 5.6% |
| i | 338440 | 5.6% |
| b | 338440 | 5.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 676880 | |
| 1 | 338440 | |
| 0 | 338440 | |
| 2 | 338440 | |
| 9 | 338440 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1353760 | |
| . | 338440 | 16.7% |
| : | 338440 | 16.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 338440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6091920 | |
| Common | 4399720 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1353760 | |
| r | 1015320 | |
| g | 676880 | |
| t | 676880 | |
| h | 338440 | 5.6% |
| w | 338440 | 5.6% |
| l | 338440 | 5.6% |
| c | 338440 | 5.6% |
| i | 338440 | 5.6% |
| b | 338440 | 5.6% |
Common
| Value | Count | Frequency (%) |
| / | 1353760 | |
| 4 | 676880 | |
| 1 | 338440 | 7.7% |
| 0 | 338440 | 7.7% |
| - | 338440 | 7.7% |
| 2 | 338440 | 7.7% |
| . | 338440 | 7.7% |
| : | 338440 | 7.7% |
| 9 | 338440 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10491640 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 1353760 | 12.9% |
| o | 1353760 | 12.9% |
| r | 1015320 | 9.7% |
| g | 676880 | 6.5% |
| t | 676880 | 6.5% |
| 4 | 676880 | 6.5% |
| h | 338440 | 3.2% |
| 1 | 338440 | 3.2% |
| w | 338440 | 3.2% |
| 0 | 338440 | 3.2% |
| Other values (10) | 3384400 |
collectionID
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad |
|---|---|
| 2nd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 3rd row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 4th row | urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 |
| 5th row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| Value | Count | Frequency (%) |
| urn:uuid:18e3cd08-a962-4f0a-b72c-9a0b3600c5ad | 119162 | |
| urn:uuid:f14c21a9-8cbf-4c8b-817f-d19d427e2dd6 | 74430 | |
| urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 | 42294 | 12.5% |
| urn:uuid:09c9cf5f-f5d3-48cc-b5c8-cd9b9fbd631f | 41606 | 12.3% |
| urn:uuid:cc104cbf-fd8e-4801-9b71-36731a7db1a0 | 28278 | 8.4% |
| urn:uuid:59e56a59-8615-4e0c-841d-eb88f3876b22 | 24507 | 7.2% |
| urn:uuid:73d83e23-1999-42cd-b38a-c06a7d32d893 | 8163 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1353760 | 8.9% |
| d | 1155311 | 7.6% |
| c | 1040667 | 6.8% |
| u | 1015320 | 6.7% |
| 8 | 917582 | 6.0% |
| 0 | 797214 | 5.2% |
| a | 775349 | 5.1% |
| 1 | 741643 | 4.9% |
| 9 | 705846 | 4.6% |
| : | 676880 | 4.4% |
| Other values (12) | 6050228 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6802549 | |
| Decimal Number | 6396611 | |
| Dash Punctuation | 1353760 | 8.9% |
| Other Punctuation | 676880 | 4.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1155311 | |
| c | 1040667 | |
| u | 1015320 | |
| a | 775349 | |
| f | 673839 | |
| b | 654013 | |
| e | 472730 | |
| i | 338440 | 5.0% |
| r | 338440 | 5.0% |
| n | 338440 | 5.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 917582 | |
| 0 | 797214 | |
| 1 | 741643 | |
| 9 | 705846 | |
| 3 | 620753 | |
| 2 | 619705 | |
| 6 | 591204 | |
| 4 | 582379 | |
| 7 | 478277 | |
| 5 | 342008 | 5.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1353760 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 676880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8427251 | |
| Latin | 6802549 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1353760 | |
| 8 | 917582 | |
| 0 | 797214 | |
| 1 | 741643 | |
| 9 | 705846 | |
| : | 676880 | |
| 3 | 620753 | |
| 2 | 619705 | |
| 6 | 591204 | |
| 4 | 582379 | |
| Other values (2) | 820285 |
Latin
| Value | Count | Frequency (%) |
| d | 1155311 | |
| c | 1040667 | |
| u | 1015320 | |
| a | 775349 | |
| f | 673839 | |
| b | 654013 | |
| e | 472730 | |
| i | 338440 | 5.0% |
| r | 338440 | 5.0% |
| n | 338440 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15229800 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1353760 | 8.9% |
| d | 1155311 | 7.6% |
| c | 1040667 | 6.8% |
| u | 1015320 | 6.7% |
| 8 | 917582 | 6.0% |
| 0 | 797214 | 5.2% |
| a | 775349 | 5.1% |
| 1 | 741643 | 4.9% |
| 9 | 705846 | 4.6% |
| : | 676880 | 4.4% |
| Other values (12) | 6050228 |
institutionCode
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.750065004 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | US |
| Value | Count | Frequency (%) |
| usnm | 296146 | |
| us | 42294 | 12.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 338440 | |
| S | 338440 | |
| N | 296146 | |
| M | 296146 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1269172 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 338440 | |
| S | 338440 | |
| N | 296146 | |
| M | 296146 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1269172 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 338440 | |
| S | 338440 | |
| N | 296146 | |
| M | 296146 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1269172 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 338440 | |
| S | 338440 | |
| N | 296146 | |
| M | 296146 |
collectionCode
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 2.982250916 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ENT |
|---|---|
| 2nd row | IZ |
| 3rd row | IZ |
| 4th row | IZ |
| 5th row | US |
| Value | Count | Frequency (%) |
| ent | 119162 | |
| iz | 74430 | |
| us | 42294 | 12.5% |
| fish | 41606 | 12.3% |
| herp | 28278 | 8.4% |
| mamm | 24507 | 7.2% |
| birds | 8163 | 2.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 147440 | |
| I | 124199 | |
| N | 119162 | |
| T | 119162 | |
| S | 92063 | |
| Z | 74430 | |
| M | 73521 | |
| H | 69884 | |
| U | 42294 | 4.2% |
| F | 41606 | 4.1% |
| Other values (5) | 105552 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1009313 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 147440 | |
| I | 124199 | |
| N | 119162 | |
| T | 119162 | |
| S | 92063 | |
| Z | 74430 | |
| M | 73521 | |
| H | 69884 | |
| U | 42294 | 4.2% |
| F | 41606 | 4.1% |
| Other values (5) | 105552 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1009313 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 147440 | |
| I | 124199 | |
| N | 119162 | |
| T | 119162 | |
| S | 92063 | |
| Z | 74430 | |
| M | 73521 | |
| H | 69884 | |
| U | 42294 | 4.2% |
| F | 41606 | 4.1% |
| Other values (5) | 105552 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1009313 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 147440 | |
| I | 124199 | |
| N | 119162 | |
| T | 119162 | |
| S | 92063 | |
| Z | 74430 | |
| M | 73521 | |
| H | 69884 | |
| U | 42294 | 4.2% |
| F | 41606 | 4.1% |
| Other values (5) | 105552 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 28 |
| Mean length | 28 |
| Min length | 28 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Material Samples (USNM) |
|---|---|
| 2nd row | NMNH Material Samples (USNM) |
| 3rd row | NMNH Material Samples (USNM) |
| 4th row | NMNH Material Samples (USNM) |
| 5th row | NMNH Material Samples (USNM) |
| Value | Count | Frequency (%) |
| nmnh | 338440 | |
| material | 338440 | |
| samples | 338440 | |
| usnm | 338440 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1015320 | |
| 1015320 | ||
| a | 1015320 | |
| M | 1015320 | |
| e | 676880 | 7.1% |
| l | 676880 | 7.1% |
| S | 676880 | 7.1% |
| p | 338440 | 3.6% |
| U | 338440 | 3.6% |
| ( | 338440 | 3.6% |
| Other values (7) | 2369080 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4399720 | |
| Uppercase Letter | 3384400 | |
| Space Separator | 1015320 | 10.7% |
| Open Punctuation | 338440 | 3.6% |
| Close Punctuation | 338440 | 3.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1015320 | |
| e | 676880 | |
| l | 676880 | |
| p | 338440 | 7.7% |
| s | 338440 | 7.7% |
| i | 338440 | 7.7% |
| m | 338440 | 7.7% |
| r | 338440 | 7.7% |
| t | 338440 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1015320 | |
| M | 1015320 | |
| S | 676880 | |
| U | 338440 | 10.0% |
| H | 338440 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1015320 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 338440 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 338440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7784120 | |
| Common | 1692200 | 17.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1015320 | |
| a | 1015320 | |
| M | 1015320 | |
| e | 676880 | |
| l | 676880 | |
| S | 676880 | |
| p | 338440 | 4.3% |
| U | 338440 | 4.3% |
| s | 338440 | 4.3% |
| i | 338440 | 4.3% |
| Other values (4) | 1353760 |
Common
| Value | Count | Frequency (%) |
| 1015320 | ||
| ( | 338440 | 20.0% |
| ) | 338440 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9476320 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1015320 | |
| 1015320 | ||
| a | 1015320 | |
| M | 1015320 | |
| e | 676880 | 7.1% |
| l | 676880 | 7.1% |
| S | 676880 | 7.1% |
| p | 338440 | 3.6% |
| U | 338440 | 3.6% |
| ( | 338440 | 3.6% |
| Other values (7) | 2369080 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | MaterialSample |
|---|---|
| 2nd row | MaterialSample |
| 3rd row | MaterialSample |
| 4th row | MaterialSample |
| 5th row | MaterialSample |
| Value | Count | Frequency (%) |
| materialsample | 338440 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1015320 | |
| e | 676880 | |
| l | 676880 | |
| M | 338440 | 7.1% |
| t | 338440 | 7.1% |
| r | 338440 | 7.1% |
| i | 338440 | 7.1% |
| S | 338440 | 7.1% |
| m | 338440 | 7.1% |
| p | 338440 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4061280 | |
| Uppercase Letter | 676880 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1015320 | |
| e | 676880 | |
| l | 676880 | |
| t | 338440 | 8.3% |
| r | 338440 | 8.3% |
| i | 338440 | 8.3% |
| m | 338440 | 8.3% |
| p | 338440 | 8.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 338440 | |
| S | 338440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4738160 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1015320 | |
| e | 676880 | |
| l | 676880 | |
| M | 338440 | 7.1% |
| t | 338440 | 7.1% |
| r | 338440 | 7.1% |
| i | 338440 | 7.1% |
| S | 338440 | 7.1% |
| m | 338440 | 7.1% |
| p | 338440 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4738160 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1015320 | |
| e | 676880 | |
| l | 676880 | |
| M | 338440 | 7.1% |
| t | 338440 | 7.1% |
| r | 338440 | 7.1% |
| i | 338440 | 7.1% |
| S | 338440 | 7.1% |
| m | 338440 | 7.1% |
| p | 338440 | 7.1% |
occurrenceID
Text
Unique 
| Distinct | 338440 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 338440 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/300028c5f-ea1d-4c01-9253-09524fc57db6 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/30006cd83-36b3-4629-86db-f5a28307189f |
| 3rd row | http://n2t.net/ark:/65665/30007a443-7a0a-49a9-9c54-cae1342160a6 |
| 4th row | http://n2t.net/ark:/65665/300098b69-426b-451c-a675-27a1b7bb5b60 |
| 5th row | http://n2t.net/ark:/65665/3000a9424-501b-43e7-a337-ee632a8fa9d0 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/300028c5f-ea1d-4c01-9253-09524fc57db6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000a9424-501b-43e7-a337-ee632a8fa9d0 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000ff086-55d6-4f50-81a9-fc07e565e180 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300114e18-4d31-4558-acc1-47ce8dd8940c | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300119514-9afd-4342-83ae-3526ac40f20f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300154f73-1f7a-4d73-8c43-7c6d66c03b0f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30015c5b5-263e-4d28-916f-89728207dfda | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3001878d3-3d26-4b66-9ad5-77d6938de137 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300187c30-1f5e-4401-a208-4e42206dc341 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300193d42-6a2a-41b9-b203-29e571953cd6 | 1 | < 0.1% |
| Other values (338430) | 338430 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 1692200 | 7.9% |
| 6 | 1649636 | 7.7% |
| - | 1353760 | 6.3% |
| t | 1353760 | 6.3% |
| 5 | 1311144 | 6.1% |
| a | 1057250 | 5.0% |
| 4 | 973748 | 4.6% |
| 3 | 973420 | 4.6% |
| 2 | 972810 | 4.6% |
| e | 971609 | 4.6% |
| Other values (16) | 9012383 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9223035 | |
| Lowercase Letter | 8037405 | |
| Other Punctuation | 2707520 | 12.7% |
| Dash Punctuation | 1353760 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1353760 | |
| a | 1057250 | |
| e | 971609 | |
| b | 718766 | |
| n | 676880 | |
| f | 635572 | |
| c | 635398 | |
| d | 634410 | |
| k | 338440 | 4.2% |
| r | 338440 | 4.2% |
| Other values (2) | 676880 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1649636 | |
| 5 | 1311144 | |
| 4 | 973748 | |
| 3 | 973420 | |
| 2 | 972810 | |
| 9 | 719712 | |
| 8 | 718307 | |
| 0 | 635068 | 6.9% |
| 1 | 634600 | 6.9% |
| 7 | 634590 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1692200 | |
| : | 676880 | 25.0% |
| . | 338440 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1353760 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13284315 | |
| Latin | 8037405 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 1692200 | |
| 6 | 1649636 | |
| - | 1353760 | |
| 5 | 1311144 | |
| 4 | 973748 | |
| 3 | 973420 | |
| 2 | 972810 | |
| 9 | 719712 | 5.4% |
| 8 | 718307 | 5.4% |
| : | 676880 | 5.1% |
| Other values (4) | 2242698 |
Latin
| Value | Count | Frequency (%) |
| t | 1353760 | |
| a | 1057250 | |
| e | 971609 | |
| b | 718766 | |
| n | 676880 | |
| f | 635572 | |
| c | 635398 | |
| d | 634410 | |
| k | 338440 | 4.2% |
| r | 338440 | 4.2% |
| Other values (2) | 676880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21321720 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 1692200 | 7.9% |
| 6 | 1649636 | 7.7% |
| - | 1353760 | 6.3% |
| t | 1353760 | 6.3% |
| 5 | 1311144 | 6.1% |
| a | 1057250 | 5.0% |
| 4 | 973748 | 4.6% |
| 3 | 973420 | 4.6% |
| 2 | 972810 | 4.6% |
| e | 971609 | 4.6% |
| Other values (16) | 9012383 |
catalogNumber
Text
Missing 
| Distinct | 226029 |
|---|---|
| Distinct (%) | 84.4% |
| Missing | 70749 |
| Missing (%) | 20.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 20 |
| Mean length | 14.08593117 |
| Min length | 9 |
Unique
| Unique | 192923 ? |
|---|---|
| Unique (%) | 72.1% |
Sample
| 1st row | USNMENT00976719.2 |
|---|---|
| 2nd row | USNM 1566725 |
| 3rd row | USNM 1430312 |
| 4th row | USNM 1477111 |
| 5th row | USNMENT01646520 |
| Value | Count | Frequency (%) |
| usnm | 146337 | |
| herp | 7481 | 1.7% |
| tissue | 7190 | 1.6% |
| us | 2194 | 0.5% |
| wet | 2190 | 0.5% |
| lot | 2190 | 0.5% |
| 2190 | 0.5% | |
| image | 291 | 0.1% |
| 594492 | 64 | < 0.1% |
| 1487948 | 58 | < 0.1% |
| Other values (223627) | 267569 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 384657 | 10.2% |
| 1 | 339055 | 9.0% |
| 0 | 282387 | 7.5% |
| S | 267692 | 7.1% |
| U | 267691 | 7.1% |
| M | 265497 | 7.0% |
| 4 | 250940 | 6.7% |
| 6 | 201528 | 5.3% |
| 3 | 187578 | 5.0% |
| 2 | 175087 | 4.6% |
| Other values (26) | 1148565 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1991709 | |
| Uppercase Letter | 1438835 | |
| Space Separator | 170063 | 4.5% |
| Other Punctuation | 95183 | 2.5% |
| Lowercase Letter | 72697 | 1.9% |
| Dash Punctuation | 2190 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 17152 | |
| s | 14380 | |
| p | 7481 | |
| r | 7481 | |
| i | 7190 | |
| u | 7190 | |
| t | 4380 | 6.0% |
| w | 2190 | 3.0% |
| l | 2190 | 3.0% |
| o | 2190 | 3.0% |
| Other values (3) | 873 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 384657 | |
| S | 267692 | |
| U | 267691 | |
| M | 265497 | |
| T | 126351 | 8.8% |
| E | 119160 | 8.3% |
| H | 7481 | 0.5% |
| I | 291 | < 0.1% |
| A | 14 | < 0.1% |
| R | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 339055 | |
| 0 | 282387 | |
| 4 | 250940 | |
| 6 | 201528 | |
| 3 | 187578 | |
| 2 | 175087 | |
| 5 | 167513 | |
| 9 | 130924 | 6.6% |
| 7 | 128542 | 6.5% |
| 8 | 128155 | 6.4% |
Space Separator
| Value | Count | Frequency (%) |
| 170063 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 95183 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2190 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2259145 | |
| Latin | 1511532 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 384657 | |
| S | 267692 | |
| U | 267691 | |
| M | 265497 | |
| T | 126351 | 8.4% |
| E | 119160 | 7.9% |
| e | 17152 | 1.1% |
| s | 14380 | 1.0% |
| p | 7481 | 0.5% |
| r | 7481 | 0.5% |
| Other values (13) | 33990 | 2.2% |
Common
| Value | Count | Frequency (%) |
| 1 | 339055 | |
| 0 | 282387 | |
| 4 | 250940 | |
| 6 | 201528 | |
| 3 | 187578 | |
| 2 | 175087 | |
| 170063 | ||
| 5 | 167513 | |
| 9 | 130924 | 5.8% |
| 7 | 128542 | 5.7% |
| Other values (3) | 225528 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3770677 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 384657 | 10.2% |
| 1 | 339055 | 9.0% |
| 0 | 282387 | 7.5% |
| S | 267692 | 7.1% |
| U | 267691 | 7.1% |
| M | 265497 | 7.0% |
| 4 | 250940 | 6.7% |
| 6 | 201528 | 5.3% |
| 3 | 187578 | 5.0% |
| 2 | 175087 | 4.6% |
| Other values (26) | 1148565 |
recordNumber
Text
Missing 
| Distinct | 103014 |
|---|---|
| Distinct (%) | 65.8% |
| Missing | 181774 |
| Missing (%) | 53.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 87 |
|---|---|
| Median length | 53 |
| Mean length | 8.258850038 |
| Min length | 1 |
Unique
| Unique | 67354 ? |
|---|---|
| Unique (%) | 43.0% |
Sample
| 1st row | T548-A9-TW19 |
|---|---|
| 2nd row | BMOO-09792 |
| 3rd row | JC3629 |
| 4th row | 707 |
| 5th row | mbio988 |
| Value | Count | Frequency (%) |
| blz | 5369 | 2.8% |
| d&ml | 4442 | 2.4% |
| 1572 | 0.8% | |
| tag | 1342 | 0.7% |
| tree | 1342 | 0.7% |
| flmoo | 1323 | 0.7% |
| blb | 1220 | 0.6% |
| sms | 1216 | 0.6% |
| bah | 991 | 0.5% |
| tob | 838 | 0.4% |
| Other values (93558) | 168768 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 122839 | 9.5% |
| 2 | 92741 | 7.2% |
| 0 | 89293 | 6.9% |
| 3 | 72397 | 5.6% |
| - | 60896 | 4.7% |
| 5 | 57984 | 4.5% |
| 4 | 57632 | 4.5% |
| 6 | 53757 | 4.2% |
| 8 | 52829 | 4.1% |
| 7 | 52283 | 4.0% |
| Other values (66) | 581230 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 701841 | |
| Uppercase Letter | 422285 | |
| Dash Punctuation | 60911 | 4.7% |
| Lowercase Letter | 41313 | 3.2% |
| Space Separator | 31757 | 2.5% |
| Connector Punctuation | 19948 | 1.5% |
| Other Punctuation | 11784 | 0.9% |
| Close Punctuation | 2021 | 0.2% |
| Open Punctuation | 2021 | 0.2% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 37587 | 8.9% |
| B | 37015 | 8.8% |
| O | 31746 | 7.5% |
| M | 31446 | 7.4% |
| S | 27844 | 6.6% |
| A | 26271 | 6.2% |
| R | 24686 | 5.8% |
| T | 22639 | 5.4% |
| L | 20614 | 4.9% |
| E | 18908 | 4.5% |
| Other values (16) | 143529 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5200 | |
| i | 4273 | |
| a | 4129 | |
| b | 4010 | |
| o | 4008 | |
| r | 3394 | |
| m | 3338 | |
| l | 2849 | |
| s | 1665 | 4.0% |
| v | 1562 | 3.8% |
| Other values (15) | 6885 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 122839 | |
| 2 | 92741 | |
| 0 | 89293 | |
| 3 | 72397 | |
| 5 | 57984 | |
| 4 | 57632 | |
| 6 | 53757 | |
| 8 | 52829 | |
| 7 | 52283 | |
| 9 | 50086 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4691 | |
| & | 4584 | |
| # | 1514 | 12.8% |
| . | 921 | 7.8% |
| / | 49 | 0.4% |
| ? | 22 | 0.2% |
| : | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 60896 | |
| – | 15 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2007 | |
| ] | 14 | 0.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2007 | |
| [ | 14 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 31757 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19948 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 830283 | |
| Latin | 463598 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 37587 | 8.1% |
| B | 37015 | 8.0% |
| O | 31746 | 6.8% |
| M | 31446 | 6.8% |
| S | 27844 | 6.0% |
| A | 26271 | 5.7% |
| R | 24686 | 5.3% |
| T | 22639 | 4.9% |
| L | 20614 | 4.4% |
| E | 18908 | 4.1% |
| Other values (41) | 184842 |
Common
| Value | Count | Frequency (%) |
| 1 | 122839 | |
| 2 | 92741 | |
| 0 | 89293 | |
| 3 | 72397 | |
| - | 60896 | |
| 5 | 57984 | |
| 4 | 57632 | |
| 6 | 53757 | |
| 8 | 52829 | |
| 7 | 52283 | |
| Other values (15) | 117632 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1293866 | |
| Punctuation | 15 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 122839 | 9.5% |
| 2 | 92741 | 7.2% |
| 0 | 89293 | 6.9% |
| 3 | 72397 | 5.6% |
| - | 60896 | 4.7% |
| 5 | 57984 | 4.5% |
| 4 | 57632 | 4.5% |
| 6 | 53757 | 4.2% |
| 8 | 52829 | 4.1% |
| 7 | 52283 | 4.0% |
| Other values (65) | 581215 |
Punctuation
| Value | Count | Frequency (%) |
| – | 15 |
recordedBy
Text
Missing 
| Distinct | 8091 |
|---|---|
| Distinct (%) | 3.0% |
| Missing | 70194 |
| Missing (%) | 20.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 161 |
|---|---|
| Median length | 107 |
| Mean length | 24.1533555 |
| Min length | 1 |
Unique
| Unique | 911 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | R. Wielgus |
|---|---|
| 2nd row | R. Vrijenhoek |
| 3rd row | S. McPherson |
| 4th row | K. Crandall, H. Robinson, J. Buhay & A. Toon |
| 5th row | Tibet-MacArthur, D. A. Bell, V. A. Funk, S. Ge, Y. Meng, Z. Nie, R. Ree, J. Wen, J. Yue & W. Zuo |
| Value | Count | Frequency (%) |
| 115581 | 8.9% | |
| m | 71033 | 5.5% |
| j | 69003 | 5.3% |
| r | 47243 | 3.6% |
| d | 44057 | 3.4% |
| c | 43631 | 3.4% |
| s | 40837 | 3.1% |
| k | 35450 | 2.7% |
| l | 29158 | 2.2% |
| a | 28418 | 2.2% |
| Other values (5514) | 776756 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1032921 | ||
| . | 565462 | 8.7% |
| e | 432466 | 6.7% |
| a | 360216 | 5.6% |
| n | 295808 | 4.6% |
| r | 285836 | 4.4% |
| i | 278879 | 4.3% |
| l | 261365 | 4.0% |
| o | 259195 | 4.0% |
| t | 196007 | 3.0% |
| Other values (73) | 2510886 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3378651 | |
| Uppercase Letter | 1203503 | 18.6% |
| Space Separator | 1032921 | 15.9% |
| Other Punctuation | 838894 | 12.9% |
| Dash Punctuation | 13931 | 0.2% |
| Decimal Number | 8809 | 0.1% |
| Close Punctuation | 1221 | < 0.1% |
| Open Punctuation | 1111 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 432466 | |
| a | 360216 | |
| n | 295808 | |
| r | 285836 | |
| i | 278879 | 8.3% |
| l | 261365 | 7.7% |
| o | 259195 | 7.7% |
| t | 196007 | 5.8% |
| s | 189168 | 5.6% |
| u | 120911 | 3.6% |
| Other values (27) | 698800 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 124673 | 10.4% |
| S | 91501 | 7.6% |
| C | 84918 | 7.1% |
| B | 82920 | 6.9% |
| R | 80213 | 6.7% |
| J | 77130 | 6.4% |
| P | 76391 | 6.3% |
| D | 68133 | 5.7% |
| L | 65369 | 5.4% |
| W | 57293 | 4.8% |
| Other values (17) | 394962 |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2232 | |
| 1 | 2078 | |
| 2 | 2016 | |
| 0 | 1932 | |
| 8 | 370 | 4.2% |
| 6 | 95 | 1.1% |
| 4 | 84 | 1.0% |
| 3 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 565462 | |
| , | 155134 | 18.5% |
| & | 115577 | 13.8% |
| / | 2047 | 0.2% |
| ' | 674 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1011 | |
| ] | 210 | 17.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 901 | |
| [ | 210 | 18.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1032921 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13931 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4582154 | |
| Common | 1896887 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 432466 | 9.4% |
| a | 360216 | 7.9% |
| n | 295808 | 6.5% |
| r | 285836 | 6.2% |
| i | 278879 | 6.1% |
| l | 261365 | 5.7% |
| o | 259195 | 5.7% |
| t | 196007 | 4.3% |
| s | 189168 | 4.1% |
| M | 124673 | 2.7% |
| Other values (54) | 1898541 |
Common
| Value | Count | Frequency (%) |
| 1032921 | ||
| . | 565462 | |
| , | 155134 | 8.2% |
| & | 115577 | 6.1% |
| - | 13931 | 0.7% |
| 9 | 2232 | 0.1% |
| 1 | 2078 | 0.1% |
| / | 2047 | 0.1% |
| 2 | 2016 | 0.1% |
| 0 | 1932 | 0.1% |
| Other values (9) | 3557 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6477050 | |
| None | 1991 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1032921 | ||
| . | 565462 | 8.7% |
| e | 432466 | 6.7% |
| a | 360216 | 5.6% |
| n | 295808 | 4.6% |
| r | 285836 | 4.4% |
| i | 278879 | 4.3% |
| l | 261365 | 4.0% |
| o | 259195 | 4.0% |
| t | 196007 | 3.0% |
| Other values (61) | 2508895 |
None
| Value | Count | Frequency (%) |
| í | 1006 | |
| é | 487 | |
| ö | 157 | 7.9% |
| á | 138 | 6.9% |
| ó | 97 | 4.9% |
| Ç | 33 | 1.7% |
| ı | 33 | 1.7% |
| ñ | 21 | 1.1% |
| ú | 12 | 0.6% |
| ü | 3 | 0.2% |
| Other values (2) | 4 | 0.2% |
individualCount
Text
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 39392 |
| Missing (%) | 11.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.00012707 |
| Min length | 1 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 295008 | |
| 0 | 2661 | 0.9% |
| 4 | 440 | 0.1% |
| 2 | 364 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 226 | 0.1% |
| 10 | 26 | < 0.1% |
| 6 | 20 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| Other values (9) | 14 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 295042 | |
| 0 | 2691 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 369 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 299086 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 295042 | |
| 0 | 2691 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 369 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 299086 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 295042 | |
| 0 | 2691 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 369 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 299086 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 295042 | |
| 0 | 2691 | 0.9% |
| 4 | 442 | 0.1% |
| 2 | 369 | 0.1% |
| 5 | 280 | 0.1% |
| 3 | 229 | 0.1% |
| 6 | 21 | < 0.1% |
| 8 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 3 | < 0.1% |
sex
Text
Missing 
| Distinct | 58 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 176515 |
| Missing (%) | 52.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 7 |
| Mean length | 6.105907056 |
| Min length | 4 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unknown |
|---|---|
| 2nd row | Unknown |
| 3rd row | Unknown |
| 4th row | Male |
| 5th row | Unknown |
| Value | Count | Frequency (%) |
| unknown | 88174 | |
| male | 41609 | |
| female | 32812 | 20.1% |
| worker | 489 | 0.3% |
| sex | 178 | 0.1% |
| hermaphrodite | 73 | < 0.1% |
| 62 | < 0.1% | |
| unable | 24 | < 0.1% |
| to | 24 | < 0.1% |
| determine | 24 | < 0.1% |
| Other values (6) | 57 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 264592 | |
| e | 108181 | |
| o | 88771 | 9.0% |
| k | 88663 | 9.0% |
| w | 88663 | 9.0% |
| U | 87723 | 8.9% |
| a | 74566 | 7.5% |
| l | 74493 | 7.5% |
| m | 37336 | 3.8% |
| M | 37219 | 3.8% |
| Other values (18) | 38492 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 831664 | |
| Uppercase Letter | 154048 | 15.6% |
| Space Separator | 1601 | 0.2% |
| Other Punctuation | 1386 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 264592 | |
| e | 108181 | |
| o | 88771 | 10.7% |
| k | 88663 | 10.7% |
| w | 88663 | 10.7% |
| a | 74566 | 9.0% |
| l | 74493 | 9.0% |
| m | 37336 | 4.5% |
| f | 3897 | 0.5% |
| r | 1159 | 0.1% |
| Other values (9) | 1343 | 0.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 87723 | |
| M | 37219 | |
| F | 28928 | 18.8% |
| S | 167 | 0.1% |
| P | 11 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1380 | |
| ? | 4 | 0.3% |
| / | 2 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1601 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 985712 | |
| Common | 2987 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 264592 | |
| e | 108181 | |
| o | 88771 | 9.0% |
| k | 88663 | 9.0% |
| w | 88663 | 9.0% |
| U | 87723 | 8.9% |
| a | 74566 | 7.6% |
| l | 74493 | 7.6% |
| m | 37336 | 3.8% |
| M | 37219 | 3.8% |
| Other values (14) | 35505 | 3.6% |
Common
| Value | Count | Frequency (%) |
| 1601 | ||
| ; | 1380 | |
| ? | 4 | 0.1% |
| / | 2 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 988699 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 264592 | |
| e | 108181 | |
| o | 88771 | 9.0% |
| k | 88663 | 9.0% |
| w | 88663 | 9.0% |
| U | 87723 | 8.9% |
| a | 74566 | 7.5% |
| l | 74493 | 7.5% |
| m | 37336 | 3.8% |
| M | 37219 | 3.8% |
| Other values (18) | 38492 | 3.9% |
lifeStage
Text
Missing 
| Distinct | 170 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 205051 |
| Missing (%) | 60.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 37 |
|---|---|
| Median length | 5 |
| Mean length | 5.180614593 |
| Min length | 1 |
Unique
| Unique | 33 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Adult |
|---|---|
| 2nd row | Adult |
| 3rd row | Adult |
| 4th row | Adult |
| 5th row | Adult |
| Value | Count | Frequency (%) |
| adult | 122000 | |
| juvenile | 3367 | 2.5% |
| larva | 1575 | 1.2% |
| ii | 1499 | 1.1% |
| flowering | 1064 | 0.8% |
| i | 883 | 0.7% |
| unknown | 548 | 0.4% |
| subadult | 538 | 0.4% |
| sterile | 365 | 0.3% |
| eft | 308 | 0.2% |
| Other values (90) | 2764 | 2.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 129264 | |
| u | 127768 | |
| t | 124368 | |
| d | 122898 | |
| A | 121282 | |
| e | 10362 | 1.5% |
| n | 6940 | 1.0% |
| a | 6310 | 0.9% |
| i | 5972 | 0.9% |
| v | 5446 | 0.8% |
| Other values (42) | 30427 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 556509 | |
| Uppercase Letter | 131353 | 19.0% |
| Other Punctuation | 1566 | 0.2% |
| Space Separator | 1522 | 0.2% |
| Dash Punctuation | 65 | < 0.1% |
| Open Punctuation | 11 | < 0.1% |
| Close Punctuation | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 129264 | |
| u | 127768 | |
| t | 124368 | |
| d | 122898 | |
| e | 10362 | 1.9% |
| n | 6940 | 1.2% |
| a | 6310 | 1.1% |
| i | 5972 | 1.1% |
| v | 5446 | 1.0% |
| r | 4535 | 0.8% |
| Other values (15) | 12646 | 2.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 121282 | |
| I | 4215 | 3.2% |
| J | 1708 | 1.3% |
| F | 1155 | 0.9% |
| S | 895 | 0.7% |
| U | 548 | 0.4% |
| L | 513 | 0.4% |
| E | 357 | 0.3% |
| P | 245 | 0.2% |
| N | 128 | 0.1% |
| Other values (9) | 307 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 1498 | |
| ? | 37 | 2.4% |
| ' | 28 | 1.8% |
| / | 3 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 1522 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 65 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 11 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 11 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 687862 | |
| Common | 3175 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 129264 | |
| u | 127768 | |
| t | 124368 | |
| d | 122898 | |
| A | 121282 | |
| e | 10362 | 1.5% |
| n | 6940 | 1.0% |
| a | 6310 | 0.9% |
| i | 5972 | 0.9% |
| v | 5446 | 0.8% |
| Other values (34) | 27252 | 4.0% |
Common
| Value | Count | Frequency (%) |
| 1522 | ||
| ; | 1498 | |
| - | 65 | 2.0% |
| ? | 37 | 1.2% |
| ' | 28 | 0.9% |
| ( | 11 | 0.3% |
| ) | 11 | 0.3% |
| / | 3 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 691009 | |
| None | 28 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 129264 | |
| u | 127768 | |
| t | 124368 | |
| d | 122898 | |
| A | 121282 | |
| e | 10362 | 1.5% |
| n | 6940 | 1.0% |
| a | 6310 | 0.9% |
| i | 5972 | 0.9% |
| v | 5446 | 0.8% |
| Other values (41) | 30399 | 4.4% |
None
| Value | Count | Frequency (%) |
| ü | 28 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | present |
|---|---|
| 2nd row | present |
| 3rd row | present |
| 4th row | present |
| 5th row | present |
| Value | Count | Frequency (%) |
| present | 338440 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 676880 | |
| p | 338440 | |
| r | 338440 | |
| s | 338440 | |
| n | 338440 | |
| t | 338440 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2369080 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 676880 | |
| p | 338440 | |
| r | 338440 | |
| s | 338440 | |
| n | 338440 | |
| t | 338440 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2369080 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 676880 | |
| p | 338440 | |
| r | 338440 | |
| s | 338440 | |
| n | 338440 | |
| t | 338440 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2369080 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 676880 | |
| p | 338440 | |
| r | 338440 | |
| s | 338440 | |
| n | 338440 | |
| t | 338440 |
preparations
Text
Missing 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 251349 |
| Missing (%) | 74.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 142 |
|---|---|
| Median length | 6 |
| Mean length | 6.192109403 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Frozen |
|---|---|
| 2nd row | Frozen |
| 3rd row | Frozen |
| 4th row | Frozen |
| 5th row | Frozen |
| Value | Count | Frequency (%) |
| frozen | 72657 | |
| vial | 6702 | 7.3% |
| ethanol | 4922 | 5.4% |
| wet | 2271 | 2.5% |
| lot | 2271 | 2.5% |
| drained | 1063 | 1.2% |
| photograph | 626 | 0.7% |
| biorepository | 456 | 0.5% |
| alcohol | 198 | 0.2% |
| 148 | 0.2% | |
| Other values (11) | 295 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 83011 | |
| n | 78739 | |
| e | 76503 | |
| r | 75316 | |
| z | 72657 | |
| F | 72306 | |
| l | 14346 | 2.7% |
| a | 13325 | 2.5% |
| t | 10601 | 2.0% |
| i | 8777 | 1.6% |
| Other values (37) | 33696 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 448059 | |
| Uppercase Letter | 84989 | 15.8% |
| Space Separator | 4518 | 0.8% |
| Other Punctuation | 837 | 0.2% |
| Decimal Number | 296 | 0.1% |
| Open Punctuation | 198 | < 0.1% |
| Close Punctuation | 198 | < 0.1% |
| Dash Punctuation | 182 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 83011 | |
| n | 78739 | |
| e | 76503 | |
| r | 75316 | |
| z | 72657 | |
| l | 14346 | 3.2% |
| a | 13325 | 3.0% |
| t | 10601 | 2.4% |
| i | 8777 | 2.0% |
| h | 6372 | 1.4% |
| Other values (13) | 8412 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 72306 | |
| E | 4957 | 5.8% |
| V | 3865 | 4.5% |
| W | 2256 | 2.7% |
| P | 626 | 0.7% |
| B | 456 | 0.5% |
| A | 243 | 0.3% |
| D | 73 | 0.1% |
| L | 49 | 0.1% |
| S | 37 | < 0.1% |
| Other values (5) | 121 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 641 | |
| % | 148 | 17.7% |
| ' | 48 | 5.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 148 | |
| 5 | 148 |
Space Separator
| Value | Count | Frequency (%) |
| 4518 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 198 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 198 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 182 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 533048 | |
| Common | 6229 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 83011 | |
| n | 78739 | |
| e | 76503 | |
| r | 75316 | |
| z | 72657 | |
| F | 72306 | |
| l | 14346 | 2.7% |
| a | 13325 | 2.5% |
| t | 10601 | 2.0% |
| i | 8777 | 1.6% |
| Other values (28) | 27467 | 5.2% |
Common
| Value | Count | Frequency (%) |
| 4518 | ||
| ; | 641 | 10.3% |
| ( | 198 | 3.2% |
| ) | 198 | 3.2% |
| - | 182 | 2.9% |
| 9 | 148 | 2.4% |
| % | 148 | 2.4% |
| 5 | 148 | 2.4% |
| ' | 48 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 539277 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 83011 | |
| n | 78739 | |
| e | 76503 | |
| r | 75316 | |
| z | 72657 | |
| F | 72306 | |
| l | 14346 | 2.7% |
| a | 13325 | 2.5% |
| t | 10601 | 2.0% |
| i | 8777 | 1.6% |
| Other values (37) | 33696 |
disposition
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.3834919 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | in collection |
|---|---|
| 2nd row | in collection |
| 3rd row | in collection |
| 4th row | in collection |
| 5th row | in collection |
| Value | Count | Frequency (%) |
| in | 298638 | |
| collection | 298638 | |
| consumed | 38038 | 6.0% |
| yes | 943 | 0.1% |
| no | 821 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 636135 | |
| o | 636135 | |
| c | 635314 | |
| i | 597276 | |
| l | 597276 | |
| e | 337619 | |
| 298638 | ||
| t | 298638 | |
| s | 38981 | 0.9% |
| u | 38038 | 0.9% |
| Other values (3) | 77019 | 1.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3892431 | |
| Space Separator | 298638 | 7.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 636135 | |
| o | 636135 | |
| c | 635314 | |
| i | 597276 | |
| l | 597276 | |
| e | 337619 | |
| t | 298638 | |
| s | 38981 | 1.0% |
| u | 38038 | 1.0% |
| m | 38038 | 1.0% |
| Other values (2) | 38981 | 1.0% |
Space Separator
| Value | Count | Frequency (%) |
| 298638 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3892431 | |
| Common | 298638 | 7.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 636135 | |
| o | 636135 | |
| c | 635314 | |
| i | 597276 | |
| l | 597276 | |
| e | 337619 | |
| t | 298638 | |
| s | 38981 | 1.0% |
| u | 38038 | 1.0% |
| m | 38038 | 1.0% |
| Other values (2) | 38981 | 1.0% |
Common
| Value | Count | Frequency (%) |
| 298638 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4191069 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 636135 | |
| o | 636135 | |
| c | 635314 | |
| i | 597276 | |
| l | 597276 | |
| e | 337619 | |
| 298638 | ||
| t | 298638 | |
| s | 38981 | 0.9% |
| u | 38038 | 0.9% |
| Other values (3) | 77019 | 1.8% |
associatedMedia
Text
Missing 
| Distinct | 11559 |
|---|---|
| Distinct (%) | 79.4% |
| Missing | 323875 |
| Missing (%) | 95.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 369 |
|---|---|
| Median length | 49 |
| Mean length | 52.45417096 |
| Min length | 49 |
Unique
| Unique | 9279 ? |
|---|---|
| Unique (%) | 63.7% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=15102863 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=15392053 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=15102609 |
| 4th row | https://collections.nmnh.si.edu/media/?i=15102164 |
| 5th row | https://collections.nmnh.si.edu/media/?i=15100806 |
| Value | Count | Frequency (%) |
| https://collections.nmnh.si.edu/media/?i=16192884 | 83 | 0.4% |
| https://collections.nmnh.si.edu/media/?i=14723169 | 38 | 0.2% |
| 14723158 | 38 | 0.2% |
| https://collections.nmnh.si.edu/media/?i=13853473 | 34 | 0.2% |
| https://collections.nmnh.si.edu/media/?i=14322468 | 30 | 0.2% |
| https://collections.nmnh.si.edu/media/?i=13822124 | 28 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=13812183 | 28 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=13812175 | 28 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=13812196 | 24 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=13858205 | 22 | 0.1% |
| Other values (14612) | 19243 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 58260 | 7.6% |
| / | 58260 | 7.6% |
| t | 43695 | 5.7% |
| s | 43695 | 5.7% |
| . | 43695 | 5.7% |
| n | 43695 | 5.7% |
| e | 43695 | 5.7% |
| 1 | 38923 | 5.1% |
| d | 29130 | 3.8% |
| m | 29130 | 3.8% |
| Other values (21) | 331817 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 451515 | |
| Decimal Number | 156768 | 20.5% |
| Other Punctuation | 136116 | 17.8% |
| Math Symbol | 14565 | 1.9% |
| Space Separator | 5031 | 0.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 58260 | |
| t | 43695 | |
| s | 43695 | |
| n | 43695 | |
| e | 43695 | |
| d | 29130 | 6.5% |
| m | 29130 | 6.5% |
| h | 29130 | 6.5% |
| o | 29130 | 6.5% |
| c | 29130 | 6.5% |
| Other values (4) | 72825 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 38923 | |
| 5 | 21259 | |
| 4 | 18285 | |
| 2 | 13953 | 8.9% |
| 0 | 13856 | 8.8% |
| 3 | 12557 | 8.0% |
| 9 | 12178 | 7.8% |
| 7 | 10019 | 6.4% |
| 8 | 8006 | 5.1% |
| 6 | 7732 | 4.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 58260 | |
| . | 43695 | |
| ? | 14565 | 10.7% |
| : | 14565 | 10.7% |
| ; | 5031 | 3.7% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 14565 |
Space Separator
| Value | Count | Frequency (%) |
| 5031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 451515 | |
| Common | 312480 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 58260 | |
| . | 43695 | |
| 1 | 38923 | |
| 5 | 21259 | 6.8% |
| 4 | 18285 | 5.9% |
| = | 14565 | 4.7% |
| ? | 14565 | 4.7% |
| : | 14565 | 4.7% |
| 2 | 13953 | 4.5% |
| 0 | 13856 | 4.4% |
| Other values (7) | 60554 |
Latin
| Value | Count | Frequency (%) |
| i | 58260 | |
| t | 43695 | |
| s | 43695 | |
| n | 43695 | |
| e | 43695 | |
| d | 29130 | 6.5% |
| m | 29130 | 6.5% |
| h | 29130 | 6.5% |
| o | 29130 | 6.5% |
| c | 29130 | 6.5% |
| Other values (4) | 72825 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 763995 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 58260 | 7.6% |
| / | 58260 | 7.6% |
| t | 43695 | 5.7% |
| s | 43695 | 5.7% |
| . | 43695 | 5.7% |
| n | 43695 | 5.7% |
| e | 43695 | 5.7% |
| 1 | 38923 | 5.1% |
| d | 29130 | 3.8% |
| m | 29130 | 3.8% |
| Other values (21) | 331817 |
Missing 
| Distinct | 25157 |
|---|---|
| Distinct (%) | 76.9% |
| Missing | 305730 |
| Missing (%) | 90.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 228 |
|---|---|
| Median length | 8 |
| Mean length | 11.35741363 |
| Min length | 8 |
Unique
| Unique | 17604 ? |
|---|---|
| Unique (%) | 53.8% |
Sample
| 1st row | MW204230; MW124559 |
|---|---|
| 2nd row | MW982336 |
| 3rd row | MF785606; MF785913 |
| 4th row | MN344605 |
| 5th row | JQ840329 |
| Value | Count | Frequency (%) |
| prjna345052 | 17 | < 0.1% |
| prjna396973 | 12 | < 0.1% |
| mn345496 | 2 | < 0.1% |
| mw982402 | 2 | < 0.1% |
| mg968118 | 2 | < 0.1% |
| mn344953 | 2 | < 0.1% |
| mw983235 | 2 | < 0.1% |
| mw983078 | 2 | < 0.1% |
| mn345717 | 2 | < 0.1% |
| mw277973 | 2 | < 0.1% |
| Other values (35813) | 43421 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 34043 | 9.2% |
| 3 | 32605 | 8.8% |
| 4 | 30970 | 8.3% |
| 9 | 27689 | 7.5% |
| M | 27269 | 7.3% |
| 2 | 27091 | 7.3% |
| 7 | 23954 | 6.4% |
| 0 | 23205 | 6.2% |
| 5 | 21376 | 5.8% |
| 1 | 21132 | 5.7% |
| Other values (25) | 102167 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 262170 | |
| Uppercase Letter | 87817 | 23.6% |
| Other Punctuation | 10758 | 2.9% |
| Space Separator | 10756 | 2.9% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 27269 | |
| W | 9704 | 11.1% |
| O | 9366 | 10.7% |
| Q | 7274 | 8.3% |
| N | 6102 | 6.9% |
| F | 4776 | 5.4% |
| J | 4067 | 4.6% |
| H | 3584 | 4.1% |
| K | 3130 | 3.6% |
| P | 2662 | 3.0% |
| Other values (11) | 9883 | 11.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 34043 | |
| 3 | 32605 | |
| 4 | 30970 | |
| 9 | 27689 | |
| 2 | 27091 | |
| 7 | 23954 | |
| 0 | 23205 | |
| 5 | 21376 | |
| 1 | 21132 | |
| 6 | 20105 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 10756 | |
| / | 1 | < 0.1% |
| . | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 10756 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 283684 | |
| Latin | 87817 | 23.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 27269 | |
| W | 9704 | 11.1% |
| O | 9366 | 10.7% |
| Q | 7274 | 8.3% |
| N | 6102 | 6.9% |
| F | 4776 | 5.4% |
| J | 4067 | 4.6% |
| H | 3584 | 4.1% |
| K | 3130 | 3.6% |
| P | 2662 | 3.0% |
| Other values (11) | 9883 | 11.3% |
Common
| Value | Count | Frequency (%) |
| 8 | 34043 | |
| 3 | 32605 | |
| 4 | 30970 | |
| 9 | 27689 | |
| 2 | 27091 | |
| 7 | 23954 | |
| 0 | 23205 | |
| 5 | 21376 | |
| 1 | 21132 | |
| 6 | 20105 | |
| Other values (4) | 21514 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 371501 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 34043 | 9.2% |
| 3 | 32605 | 8.8% |
| 4 | 30970 | 8.3% |
| 9 | 27689 | 7.5% |
| M | 27269 | 7.3% |
| 2 | 27091 | 7.3% |
| 7 | 23954 | 6.4% |
| 0 | 23205 | 6.2% |
| 5 | 21376 | 5.8% |
| 1 | 21132 | 5.7% |
| Other values (25) | 102167 |
Missing 
| Distinct | 28700 |
|---|---|
| Distinct (%) | 19.8% |
| Missing | 193737 |
| Missing (%) | 57.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 120818 |
|---|---|
| Median length | 61 |
| Mean length | 79.26073406 |
| Min length | 1 |
Unique
| Unique | 19961 ? |
|---|---|
| Unique (%) | 13.8% |
Sample
| 1st row | One leg removed for genetic sampling while on loan to GUELPH. |
|---|---|
| 2nd row | Order: 10948; Box Number: MBARI_0136: Box Position: B/4 |
| 3rd row | One leg removed for genetic sampling while on loan to GUELPH. |
| 4th row | Originally cataloged as an image record because field notes indicated there was a photovoucher for the specimen. When the images were cataloged in early 2020, no photos were found for this specimen so the record was changed to a Genetic Sample (DNA) with no voucher. |
| 5th row | Entire tissue sample consumed for DNA extraction. Specimen voucher located at Museum National d'Histoire Naturelle, Paris. |
| Value | Count | Frequency (%) |
| for | 114843 | 6.1% |
| on | 113412 | 6.0% |
| to | 111958 | 5.9% |
| genetic | 110770 | 5.9% |
| while | 109786 | 5.8% |
| sampling | 108913 | 5.8% |
| loan | 108870 | 5.8% |
| removed | 108857 | 5.8% |
| guelph | 108797 | 5.8% |
| one | 105620 | 5.6% |
| Other values (39978) | 787611 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1726121 | 15.0% | |
| e | 1095754 | 9.6% |
| o | 790058 | 6.9% |
| n | 707325 | 6.2% |
| l | 585110 | 5.1% |
| i | 570424 | 5.0% |
| a | 443981 | 3.9% |
| t | 412504 | 3.6% |
| r | 412405 | 3.6% |
| g | 358988 | 3.1% |
| Other values (106) | 4366596 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7289984 | |
| Space Separator | 1726121 | 15.0% |
| Uppercase Letter | 1411685 | 12.3% |
| Decimal Number | 488632 | 4.3% |
| Other Punctuation | 397122 | 3.5% |
| Control | 85346 | 0.7% |
| Dash Punctuation | 28244 | 0.2% |
| Math Symbol | 18258 | 0.2% |
| Connector Punctuation | 15626 | 0.1% |
| Open Punctuation | 4114 | < 0.1% |
| Other values (4) | 4134 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1095754 | |
| o | 790058 | |
| n | 707325 | |
| l | 585110 | 8.0% |
| i | 570424 | 7.8% |
| a | 443981 | 6.1% |
| t | 412504 | 5.7% |
| r | 412405 | 5.7% |
| g | 358988 | 4.9% |
| m | 292695 | 4.0% |
| Other values (31) | 1620740 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 144183 | |
| O | 140094 | |
| G | 139730 | |
| H | 124624 | |
| U | 124299 | |
| E | 121592 | |
| L | 117111 | 8.3% |
| B | 74857 | 5.3% |
| N | 66333 | 4.7% |
| M | 62836 | 4.5% |
| Other values (19) | 296026 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 176037 | |
| : | 84185 | |
| ; | 72201 | |
| , | 34683 | 8.7% |
| / | 19654 | 4.9% |
| ' | 3148 | 0.8% |
| " | 3113 | 0.8% |
| # | 2394 | 0.6% |
| & | 1017 | 0.3% |
| ? | 575 | 0.1% |
| Other values (4) | 115 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 88589 | |
| 0 | 74354 | |
| 2 | 52624 | |
| 9 | 47648 | |
| 3 | 42145 | |
| 4 | 38220 | |
| 5 | 37436 | |
| 6 | 36920 | |
| 8 | 36777 | |
| 7 | 33919 | 6.9% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 17759 | |
| = | 399 | 2.2% |
| + | 57 | 0.3% |
| ~ | 17 | 0.1% |
| < | 16 | 0.1% |
| > | 10 | 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3436 | |
| ] | 667 | 16.2% |
| } | 6 | 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 14 | |
| ♀ | 5 | 21.7% |
| ♂ | 4 | 17.4% |
Control
| Value | Count | Frequency (%) |
| 84899 | ||
| 447 | 0.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 28001 | |
| — | 243 | 0.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3444 | |
| [ | 670 | 16.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1726121 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15626 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 1 |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8701657 | |
| Common | 2767609 | 24.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1095754 | 12.6% |
| o | 790058 | 9.1% |
| n | 707325 | 8.1% |
| l | 585110 | 6.7% |
| i | 570424 | 6.6% |
| a | 443981 | 5.1% |
| t | 412504 | 4.7% |
| r | 412405 | 4.7% |
| g | 358988 | 4.1% |
| m | 292695 | 3.4% |
| Other values (59) | 3032413 |
Common
| Value | Count | Frequency (%) |
| 1726121 | ||
| . | 176037 | 6.4% |
| 1 | 88589 | 3.2% |
| 84899 | 3.1% | |
| : | 84185 | 3.0% |
| 0 | 74354 | 2.7% |
| ; | 72201 | 2.6% |
| 2 | 52624 | 1.9% |
| 9 | 47648 | 1.7% |
| 3 | 42145 | 1.5% |
| Other values (37) | 318806 | 11.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11468894 | |
| Punctuation | 243 | < 0.1% |
| None | 120 | < 0.1% |
| Misc Symbols | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1726121 | 15.1% | |
| e | 1095754 | 9.6% |
| o | 790058 | 6.9% |
| n | 707325 | 6.2% |
| l | 585110 | 5.1% |
| i | 570424 | 5.0% |
| a | 443981 | 3.9% |
| t | 412504 | 3.6% |
| r | 412405 | 3.6% |
| g | 358988 | 3.1% |
| Other values (81) | 4366224 |
Punctuation
| Value | Count | Frequency (%) |
| — | 243 |
None
| Value | Count | Frequency (%) |
| é | 24 | |
| ã | 15 | |
| ° | 14 | |
| í | 14 | |
| µ | 12 | |
| ó | 9 | 7.5% |
| Î | 4 | 3.3% |
| ç | 4 | 3.3% |
| á | 4 | 3.3% |
| ¿ | 3 | 2.5% |
| Other values (12) | 17 |
Misc Symbols
| Value | Count | Frequency (%) |
| ♀ | 5 | |
| ♂ | 4 |
organismID
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 12.5 |
| Mean length | 11.25 |
| Min length | 10 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 56°07'00"W |
|---|---|
| 2nd row | 2°27'29.11"W |
| 3rd row | 57°39'00"W |
| 4th row | 138deg49'41"E |
| Value | Count | Frequency (%) |
| 56°07'00"w | 1 | |
| 2°27'29.11"w | 1 | |
| 57°39'00"w | 1 | |
| 138deg49'41"e | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 4 | 8.9% |
| ' | 4 | 8.9% |
| " | 4 | 8.9% |
| 9 | 3 | 6.7% |
| ° | 3 | 6.7% |
| 7 | 3 | 6.7% |
| W | 3 | 6.7% |
| 2 | 3 | 6.7% |
| 4 | 2 | 4.4% |
| Other values (9) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26 | |
| Other Punctuation | 9 | 20.0% |
| Uppercase Letter | 4 | 8.9% |
| Other Symbol | 3 | 6.7% |
| Lowercase Letter | 3 | 6.7% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 4 | |
| 9 | 3 | |
| 7 | 3 | |
| 2 | 3 | |
| 4 | 2 | 7.7% |
| 3 | 2 | 7.7% |
| 5 | 2 | 7.7% |
| 8 | 1 | 3.8% |
| 6 | 1 | 3.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 4 | |
| " | 4 | |
| . | 1 | 11.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| d | 1 | |
| e | 1 | |
| g | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 3 | |
| E | 1 | 25.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 38 | |
| Latin | 7 | 15.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 4 | |
| ' | 4 | |
| " | 4 | |
| 9 | 3 | |
| ° | 3 | |
| 7 | 3 | |
| 2 | 3 | |
| 4 | 2 | 5.3% |
| 3 | 2 | 5.3% |
| Other values (4) | 5 |
Latin
| Value | Count | Frequency (%) |
| W | 3 | |
| d | 1 | 14.3% |
| e | 1 | 14.3% |
| g | 1 | 14.3% |
| E | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42 | |
| None | 3 | 6.7% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5 | |
| 1 | 4 | |
| ' | 4 | |
| " | 4 | |
| 9 | 3 | 7.1% |
| 7 | 3 | 7.1% |
| W | 3 | 7.1% |
| 2 | 3 | 7.1% |
| 4 | 2 | 4.8% |
| 3 | 2 | 4.8% |
| Other values (8) | 9 |
None
| Value | Count | Frequency (%) |
| ° | 3 |
organismName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338438 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 918.0 |
|---|---|
| 2nd row | 651.0 |
| Value | Count | Frequency (%) |
| 918.0 | 1 | |
| 651.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 | |
| 9 | 1 | |
| 8 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 | |
| Other Punctuation | 2 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 0 | 2 | |
| 9 | 1 | |
| 8 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 | |
| 9 | 1 | |
| 8 | 1 | |
| 6 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| . | 2 | |
| 0 | 2 | |
| 9 | 1 | |
| 8 | 1 | |
| 6 | 1 | |
| 5 | 1 |
organismScope
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338439 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 963.0 |
|---|
| Value | Count | Frequency (%) |
| 963.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| 0 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| . | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 9 | 1 | |
| 6 | 1 | |
| 3 | 1 | |
| . | 1 | |
| 0 | 1 |
materialSampleID
Text
Missing 
| Distinct | 253362 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 85078 |
| Missing (%) | 25.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 253362 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | AR5TC43 |
|---|---|
| 2nd row | AL2IC84 |
| 3rd row | AF9HI08 |
| 4th row | AD5JZ99 |
| 5th row | AE0OQ35 |
| Value | Count | Frequency (%) |
| ar5tc43 | 1 | < 0.1% |
| ae3rz90 | 1 | < 0.1% |
| am1rc30 | 1 | < 0.1% |
| al5lg46 | 1 | < 0.1% |
| an9jb30 | 1 | < 0.1% |
| af9hi08 | 1 | < 0.1% |
| ad5jz99 | 1 | < 0.1% |
| ae0oq35 | 1 | < 0.1% |
| an7hd65 | 1 | < 0.1% |
| ak3zy87 | 1 | < 0.1% |
| Other values (253352) | 253352 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 287914 | 16.2% |
| 7 | 79879 | 4.5% |
| 1 | 77955 | 4.4% |
| 2 | 77347 | 4.4% |
| 0 | 77102 | 4.3% |
| 4 | 76676 | 4.3% |
| 5 | 76415 | 4.3% |
| 3 | 76089 | 4.3% |
| 9 | 75065 | 4.2% |
| 6 | 73706 | 4.2% |
| Other values (26) | 795386 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1013448 | |
| Decimal Number | 760086 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 287914 | |
| O | 39979 | 3.9% |
| R | 39164 | 3.9% |
| K | 38882 | 3.8% |
| E | 36182 | 3.6% |
| C | 35417 | 3.5% |
| L | 34815 | 3.4% |
| H | 34226 | 3.4% |
| I | 33753 | 3.3% |
| F | 33691 | 3.3% |
| Other values (16) | 399425 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 79879 | |
| 1 | 77955 | |
| 2 | 77347 | |
| 0 | 77102 | |
| 4 | 76676 | |
| 5 | 76415 | |
| 3 | 76089 | |
| 9 | 75065 | |
| 6 | 73706 | |
| 8 | 69852 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1013448 | |
| Common | 760086 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 287914 | |
| O | 39979 | 3.9% |
| R | 39164 | 3.9% |
| K | 38882 | 3.8% |
| E | 36182 | 3.6% |
| C | 35417 | 3.5% |
| L | 34815 | 3.4% |
| H | 34226 | 3.4% |
| I | 33753 | 3.3% |
| F | 33691 | 3.3% |
| Other values (16) | 399425 |
Common
| Value | Count | Frequency (%) |
| 7 | 79879 | |
| 1 | 77955 | |
| 2 | 77347 | |
| 0 | 77102 | |
| 4 | 76676 | |
| 5 | 76415 | |
| 3 | 76089 | |
| 9 | 75065 | |
| 6 | 73706 | |
| 8 | 69852 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1773534 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 287914 | 16.2% |
| 7 | 79879 | 4.5% |
| 1 | 77955 | 4.4% |
| 2 | 77347 | 4.4% |
| 0 | 77102 | 4.3% |
| 4 | 76676 | 4.3% |
| 5 | 76415 | 4.3% |
| 3 | 76089 | 4.3% |
| 9 | 75065 | 4.2% |
| 6 | 73706 | 4.2% |
| Other values (26) | 795386 |
eventType
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.75 |
| Min length | 6 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 10.6925 |
|---|---|
| 2nd row | 5.55461 |
| 3rd row | 7.1633 |
| 4th row | 5.80961 |
| Value | Count | Frequency (%) |
| 10.6925 | 1 | |
| 5.55461 | 1 | |
| 7.1633 | 1 | |
| 5.80961 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | |
| . | 4 | |
| 6 | 4 | |
| 0 | 2 | 7.4% |
| 9 | 2 | 7.4% |
| 3 | 2 | 7.4% |
| 2 | 1 | 3.7% |
| 4 | 1 | 3.7% |
| 7 | 1 | 3.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23 | |
| Other Punctuation | 4 | 14.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | |
| 6 | 4 | |
| 0 | 2 | 8.7% |
| 9 | 2 | 8.7% |
| 3 | 2 | 8.7% |
| 2 | 1 | 4.3% |
| 4 | 1 | 4.3% |
| 7 | 1 | 4.3% |
| 8 | 1 | 4.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 27 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | |
| . | 4 | |
| 6 | 4 | |
| 0 | 2 | 7.4% |
| 9 | 2 | 7.4% |
| 3 | 2 | 7.4% |
| 2 | 1 | 3.7% |
| 4 | 1 | 3.7% |
| 7 | 1 | 3.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 27 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | 5 | |
| 1 | 4 | |
| . | 4 | |
| 6 | 4 | |
| 0 | 2 | 7.4% |
| 9 | 2 | 7.4% |
| 3 | 2 | 7.4% |
| 2 | 1 | 3.7% |
| 4 | 1 | 3.7% |
| 7 | 1 | 3.7% |
fieldNumber
Text
Missing 
| Distinct | 7070 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 267431 |
| Missing (%) | 79.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 43 |
| Mean length | 11.55291583 |
| Min length | 1 |
Unique
| Unique | 2667 ? |
|---|---|
| Unique (%) | 3.8% |
Sample
| 1st row | MBARI/T548 |
|---|---|
| 2nd row | MBIO/BIZ-231 |
| 3rd row | Moorea F-06-12 |
| 4th row | MBARI/T488 |
| 5th row | AL-4097 |
| Value | Count | Frequency (%) |
| cb | 3400 | 3.7% |
| moorea | 3156 | 3.5% |
| fp | 1216 | 1.3% |
| lrp | 1033 | 1.1% |
| bah | 991 | 1.1% |
| tob | 838 | 0.9% |
| cur | 813 | 0.9% |
| mbio/080611_minv_014 | 626 | 0.7% |
| dgs | 506 | 0.6% |
| sec18-07 | 504 | 0.6% |
| Other values (7241) | 78083 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 78674 | 9.6% |
| - | 70165 | 8.6% |
| 1 | 62347 | 7.6% |
| B | 45941 | 5.6% |
| 2 | 44017 | 5.4% |
| I | 35464 | 4.3% |
| M | 34420 | 4.2% |
| A | 34162 | 4.2% |
| 3 | 27081 | 3.3% |
| 8 | 21679 | 2.6% |
| Other values (62) | 366411 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 326117 | |
| Decimal Number | 325547 | |
| Dash Punctuation | 70165 | 8.6% |
| Lowercase Letter | 34360 | 4.2% |
| Other Punctuation | 26194 | 3.2% |
| Space Separator | 20157 | 2.5% |
| Connector Punctuation | 17780 | 2.2% |
| Math Symbol | 37 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 45941 | |
| I | 35464 | |
| M | 34420 | |
| A | 34162 | |
| R | 18907 | 5.8% |
| S | 18652 | 5.7% |
| O | 17093 | 5.2% |
| C | 16710 | 5.1% |
| L | 16139 | 4.9% |
| U | 13291 | 4.1% |
| Other values (16) | 75338 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 7987 | |
| e | 4721 | |
| r | 3987 | |
| a | 3742 | |
| n | 2355 | 6.9% |
| i | 1842 | 5.4% |
| m | 1834 | 5.3% |
| t | 1781 | 5.2% |
| v | 1564 | 4.6% |
| l | 1062 | 3.1% |
| Other values (14) | 3485 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 78674 | |
| 1 | 62347 | |
| 2 | 44017 | |
| 3 | 27081 | 8.3% |
| 8 | 21679 | 6.7% |
| 6 | 20194 | 6.2% |
| 4 | 19711 | 6.1% |
| 7 | 19228 | 5.9% |
| 5 | 17540 | 5.4% |
| 9 | 15076 | 4.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 21523 | |
| ; | 4626 | 17.7% |
| . | 18 | 0.1% |
| # | 14 | 0.1% |
| : | 12 | < 0.1% |
| , | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 70165 |
Space Separator
| Value | Count | Frequency (%) |
| 20157 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 17780 |
Math Symbol
| Value | Count | Frequency (%) |
| > | 37 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 459884 | |
| Latin | 360477 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| B | 45941 | |
| I | 35464 | 9.8% |
| M | 34420 | 9.5% |
| A | 34162 | 9.5% |
| R | 18907 | 5.2% |
| S | 18652 | 5.2% |
| O | 17093 | 4.7% |
| C | 16710 | 4.6% |
| L | 16139 | 4.5% |
| U | 13291 | 3.7% |
| Other values (40) | 109698 |
Common
| Value | Count | Frequency (%) |
| 0 | 78674 | |
| - | 70165 | |
| 1 | 62347 | |
| 2 | 44017 | |
| 3 | 27081 | 5.9% |
| 8 | 21679 | 4.7% |
| / | 21523 | 4.7% |
| 6 | 20194 | 4.4% |
| 20157 | 4.4% | |
| 4 | 19711 | 4.3% |
| Other values (12) | 74336 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 820361 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 78674 | 9.6% |
| - | 70165 | 8.6% |
| 1 | 62347 | 7.6% |
| B | 45941 | 5.6% |
| 2 | 44017 | 5.4% |
| I | 35464 | 4.3% |
| M | 34420 | 4.2% |
| A | 34162 | 4.2% |
| 3 | 27081 | 3.3% |
| 8 | 21679 | 2.6% |
| Other values (62) | 366411 |
eventDate
Text
Missing 
| Distinct | 23060 |
|---|---|
| Distinct (%) | 7.2% |
| Missing | 16369 |
| Missing (%) | 4.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 10 |
| Mean length | 11.08453105 |
| Min length | 4 |
Unique
| Unique | 1370 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 1977-05-21 |
|---|---|
| 2nd row | 2003-04-05 |
| 3rd row | 2009-12-05 |
| 4th row | 2006-09-14 |
| 5th row | 2003-05-01/2003-05-13 |
| Value | Count | Frequency (%) |
| 2018-03-19/2018-03-23 | 1120 | 0.3% |
| 2016-02-22/2016-03-09 | 842 | 0.3% |
| 2008-06-11 | 649 | 0.2% |
| 2017-05-26 | 623 | 0.2% |
| 2015-05-09 | 524 | 0.2% |
| 2017-05-23 | 519 | 0.2% |
| 2017-05-30 | 515 | 0.2% |
| 2006-03-12 | 513 | 0.2% |
| 2017-08-14 | 508 | 0.2% |
| 2017-05-27 | 505 | 0.2% |
| Other values (23050) | 315807 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 772965 | |
| - | 704820 | |
| 1 | 577630 | |
| 2 | 414060 | |
| 9 | 302792 | 8.5% |
| 8 | 151695 | 4.2% |
| 7 | 140417 | 3.9% |
| 6 | 131389 | 3.7% |
| 5 | 123971 | 3.5% |
| 3 | 120245 | 3.4% |
| Other values (14) | 130022 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2831662 | |
| Dash Punctuation | 704820 | 19.7% |
| Other Punctuation | 33409 | 0.9% |
| Space Separator | 54 | < 0.1% |
| Lowercase Letter | 52 | < 0.1% |
| Uppercase Letter | 7 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 772965 | |
| 1 | 577630 | |
| 2 | 414060 | |
| 9 | 302792 | 10.7% |
| 8 | 151695 | 5.4% |
| 7 | 140417 | 5.0% |
| 6 | 131389 | 4.6% |
| 5 | 123971 | 4.4% |
| 3 | 120245 | 4.2% |
| 4 | 96498 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 2 | |
| S | 2 | |
| W | 1 | |
| E | 1 | |
| P | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 33225 | |
| , | 183 | 0.5% |
| : | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 26 | |
| r | 26 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 704820 |
Space Separator
| Value | Count | Frequency (%) |
| 54 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3569947 | |
| Latin | 59 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 772965 | |
| - | 704820 | |
| 1 | 577630 | |
| 2 | 414060 | |
| 9 | 302792 | 8.5% |
| 8 | 151695 | 4.2% |
| 7 | 140417 | 3.9% |
| 6 | 131389 | 3.7% |
| 5 | 123971 | 3.5% |
| 3 | 120245 | 3.4% |
| Other values (7) | 129963 | 3.6% |
Latin
| Value | Count | Frequency (%) |
| o | 26 | |
| r | 26 | |
| G | 2 | 3.4% |
| S | 2 | 3.4% |
| W | 1 | 1.7% |
| E | 1 | 1.7% |
| P | 1 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3570006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 772965 | |
| - | 704820 | |
| 1 | 577630 | |
| 2 | 414060 | |
| 9 | 302792 | 8.5% |
| 8 | 151695 | 4.2% |
| 7 | 140417 | 3.9% |
| 6 | 131389 | 3.7% |
| 5 | 123971 | 3.5% |
| 3 | 120245 | 3.4% |
| Other values (14) | 130022 | 3.6% |
eventTime
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338439 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 94648 |
|---|
| Value | Count | Frequency (%) |
| 94648 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4 | 2 | |
| 9 | 1 | |
| 6 | 1 | |
| 8 | 1 |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 18131 |
| Missing (%) | 5.4% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.771973313 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 141 |
|---|---|
| 2nd row | 95 |
| 3rd row | 339 |
| 4th row | 257 |
| 5th row | 121 |
| Value | Count | Frequency (%) |
| 142 | 2414 | 0.8% |
| 78 | 1966 | 0.6% |
| 140 | 1912 | 0.6% |
| 147 | 1847 | 0.6% |
| 201 | 1845 | 0.6% |
| 152 | 1832 | 0.6% |
| 197 | 1819 | 0.6% |
| 182 | 1814 | 0.6% |
| 150 | 1806 | 0.6% |
| 146 | 1793 | 0.6% |
| Other values (356) | 301261 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 193442 | |
| 2 | 157736 | |
| 3 | 100838 | |
| 4 | 66349 | 7.5% |
| 5 | 65465 | 7.4% |
| 7 | 63652 | 7.2% |
| 0 | 61103 | 6.9% |
| 6 | 61082 | 6.9% |
| 8 | 59915 | 6.7% |
| 9 | 58306 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 887888 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 193442 | |
| 2 | 157736 | |
| 3 | 100838 | |
| 4 | 66349 | 7.5% |
| 5 | 65465 | 7.4% |
| 7 | 63652 | 7.2% |
| 0 | 61103 | 6.9% |
| 6 | 61082 | 6.9% |
| 8 | 59915 | 6.7% |
| 9 | 58306 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 887888 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 193442 | |
| 2 | 157736 | |
| 3 | 100838 | |
| 4 | 66349 | 7.5% |
| 5 | 65465 | 7.4% |
| 7 | 63652 | 7.2% |
| 0 | 61103 | 6.9% |
| 6 | 61082 | 6.9% |
| 8 | 59915 | 6.7% |
| 9 | 58306 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 887888 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 193442 | |
| 2 | 157736 | |
| 3 | 100838 | |
| 4 | 66349 | 7.5% |
| 5 | 65465 | 7.4% |
| 7 | 63652 | 7.2% |
| 0 | 61103 | 6.9% |
| 6 | 61082 | 6.9% |
| 8 | 59915 | 6.7% |
| 9 | 58306 | 6.6% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 17911 |
| Missing (%) | 5.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.778768848 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 141 |
|---|---|
| 2nd row | 95 |
| 3rd row | 339 |
| 4th row | 257 |
| 5th row | 133 |
| Value | Count | Frequency (%) |
| 142 | 2346 | 0.7% |
| 151 | 2066 | 0.6% |
| 150 | 2017 | 0.6% |
| 82 | 1898 | 0.6% |
| 212 | 1891 | 0.6% |
| 143 | 1865 | 0.6% |
| 69 | 1862 | 0.6% |
| 197 | 1800 | 0.6% |
| 146 | 1794 | 0.6% |
| 147 | 1756 | 0.5% |
| Other values (356) | 301234 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 189931 | |
| 2 | 159764 | |
| 3 | 101857 | |
| 4 | 67311 | 7.6% |
| 5 | 65248 | 7.3% |
| 0 | 63136 | 7.1% |
| 6 | 62126 | 7.0% |
| 7 | 61959 | 7.0% |
| 8 | 59962 | 6.7% |
| 9 | 59382 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 890676 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 189931 | |
| 2 | 159764 | |
| 3 | 101857 | |
| 4 | 67311 | 7.6% |
| 5 | 65248 | 7.3% |
| 0 | 63136 | 7.1% |
| 6 | 62126 | 7.0% |
| 7 | 61959 | 7.0% |
| 8 | 59962 | 6.7% |
| 9 | 59382 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 890676 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 189931 | |
| 2 | 159764 | |
| 3 | 101857 | |
| 4 | 67311 | 7.6% |
| 5 | 65248 | 7.3% |
| 0 | 63136 | 7.1% |
| 6 | 62126 | 7.0% |
| 7 | 61959 | 7.0% |
| 8 | 59962 | 6.7% |
| 9 | 59382 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 890676 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 189931 | |
| 2 | 159764 | |
| 3 | 101857 | |
| 4 | 67311 | 7.6% |
| 5 | 65248 | 7.3% |
| 0 | 63136 | 7.1% |
| 6 | 62126 | 7.0% |
| 7 | 61959 | 7.0% |
| 8 | 59962 | 6.7% |
| 9 | 59382 | 6.7% |
year
Text
Missing 
| Distinct | 158 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 16370 |
| Missing (%) | 4.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1977 |
|---|---|
| 2nd row | 2003 |
| 3rd row | 2009 |
| 4th row | 2006 |
| 5th row | 2003 |
| Value | Count | Frequency (%) |
| 2009 | 14264 | 4.4% |
| 2017 | 14067 | 4.4% |
| 2015 | 13813 | 4.3% |
| 2010 | 13737 | 4.3% |
| 2012 | 12220 | 3.8% |
| 2008 | 11987 | 3.7% |
| 2016 | 11451 | 3.6% |
| 2018 | 11086 | 3.4% |
| 2019 | 9910 | 3.1% |
| 2006 | 9459 | 2.9% |
| Other values (148) | 200076 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 296276 | |
| 1 | 274919 | |
| 2 | 223079 | |
| 9 | 214556 | |
| 8 | 70592 | 5.5% |
| 7 | 60236 | 4.7% |
| 6 | 50357 | 3.9% |
| 5 | 38161 | 3.0% |
| 3 | 30589 | 2.4% |
| 4 | 29515 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1288280 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 296276 | |
| 1 | 274919 | |
| 2 | 223079 | |
| 9 | 214556 | |
| 8 | 70592 | 5.5% |
| 7 | 60236 | 4.7% |
| 6 | 50357 | 3.9% |
| 5 | 38161 | 3.0% |
| 3 | 30589 | 2.4% |
| 4 | 29515 | 2.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1288280 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 296276 | |
| 1 | 274919 | |
| 2 | 223079 | |
| 9 | 214556 | |
| 8 | 70592 | 5.5% |
| 7 | 60236 | 4.7% |
| 6 | 50357 | 3.9% |
| 5 | 38161 | 3.0% |
| 3 | 30589 | 2.4% |
| 4 | 29515 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1288280 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 296276 | |
| 1 | 274919 | |
| 2 | 223079 | |
| 9 | 214556 | |
| 8 | 70592 | 5.5% |
| 7 | 60236 | 4.7% |
| 6 | 50357 | 3.9% |
| 5 | 38161 | 3.0% |
| 3 | 30589 | 2.4% |
| 4 | 29515 | 2.3% |
month
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 17966 |
| Missing (%) | 5.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 1 |
| Mean length | 1.178235988 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 4 |
| 3rd row | 12 |
| 4th row | 9 |
| 5th row | 5 |
| Value | Count | Frequency (%) |
| 5 | 42822 | |
| 6 | 37403 | |
| 7 | 36987 | |
| 8 | 30887 | |
| 4 | 28913 | |
| 3 | 27542 | |
| 9 | 25613 | |
| 10 | 23427 | |
| 11 | 20374 | |
| 2 | 16876 | 5.3% |
| Other values (6) | 29632 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 93804 | |
| 5 | 42823 | |
| 6 | 37405 | 9.9% |
| 7 | 36988 | 9.8% |
| 8 | 30889 | 8.2% |
| 2 | 30178 | 8.0% |
| 4 | 28913 | 7.7% |
| 3 | 27542 | 7.3% |
| 9 | 25615 | 6.8% |
| 0 | 23432 | 6.2% |
| Other values (3) | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 377589 | |
| Other Punctuation | 2 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 93804 | |
| 5 | 42823 | |
| 6 | 37405 | 9.9% |
| 7 | 36988 | 9.8% |
| 8 | 30889 | 8.2% |
| 2 | 30178 | 8.0% |
| 4 | 28913 | 7.7% |
| 3 | 27542 | 7.3% |
| 9 | 25615 | 6.8% |
| 0 | 23432 | 6.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 377593 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 93804 | |
| 5 | 42823 | |
| 6 | 37405 | 9.9% |
| 7 | 36988 | 9.8% |
| 8 | 30889 | 8.2% |
| 2 | 30178 | 8.0% |
| 4 | 28913 | 7.7% |
| 3 | 27542 | 7.3% |
| 9 | 25615 | 6.8% |
| 0 | 23432 | 6.2% |
| Other values (2) | 4 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 377594 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 93804 | |
| 5 | 42823 | |
| 6 | 37405 | 9.9% |
| 7 | 36988 | 9.8% |
| 8 | 30889 | 8.2% |
| 2 | 30178 | 8.0% |
| 4 | 28913 | 7.7% |
| 3 | 27542 | 7.3% |
| 9 | 25615 | 6.8% |
| 0 | 23432 | 6.2% |
| Other values (3) | 5 | < 0.1% |
day
Text
Missing 
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 19384 |
| Missing (%) | 5.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 2 |
| Mean length | 1.689853819 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 21 |
|---|---|
| 2nd row | 5 |
| 3rd row | 5 |
| 4th row | 14 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 18050 | 5.7% |
| 15 | 11503 | 3.6% |
| 11 | 11303 | 3.5% |
| 12 | 11300 | 3.5% |
| 5 | 11212 | 3.5% |
| 22 | 11187 | 3.5% |
| 16 | 11185 | 3.5% |
| 10 | 11174 | 3.5% |
| 8 | 10958 | 3.4% |
| 19 | 10620 | 3.3% |
| Other values (25) | 200566 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 150942 | |
| 2 | 130384 | |
| 3 | 43011 | 8.0% |
| 8 | 32046 | 5.9% |
| 5 | 31740 | 5.9% |
| 6 | 30889 | 5.7% |
| 9 | 30427 | 5.6% |
| 0 | 30285 | 5.6% |
| 7 | 30076 | 5.6% |
| 4 | 29353 | 5.4% |
| Other values (3) | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 539153 | |
| Other Punctuation | 2 | < 0.1% |
| Space Separator | 2 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 150942 | |
| 2 | 130384 | |
| 3 | 43011 | 8.0% |
| 8 | 32046 | 5.9% |
| 5 | 31740 | 5.9% |
| 6 | 30889 | 5.7% |
| 9 | 30427 | 5.6% |
| 0 | 30285 | 5.6% |
| 7 | 30076 | 5.6% |
| 4 | 29353 | 5.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 539157 | |
| Latin | 1 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 150942 | |
| 2 | 130384 | |
| 3 | 43011 | 8.0% |
| 8 | 32046 | 5.9% |
| 5 | 31740 | 5.9% |
| 6 | 30889 | 5.7% |
| 9 | 30427 | 5.6% |
| 0 | 30285 | 5.6% |
| 7 | 30076 | 5.6% |
| 4 | 29353 | 5.4% |
| Other values (2) | 4 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| E | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 539158 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 150942 | |
| 2 | 130384 | |
| 3 | 43011 | 8.0% |
| 8 | 32046 | 5.9% |
| 5 | 31740 | 5.9% |
| 6 | 30889 | 5.7% |
| 9 | 30427 | 5.6% |
| 0 | 30285 | 5.6% |
| 7 | 30076 | 5.6% |
| 4 | 29353 | 5.4% |
| Other values (3) | 5 | < 0.1% |
Missing 
| Distinct | 10232 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 236098 |
| Missing (%) | 69.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 72 |
|---|---|
| Median length | 71 |
| Mean length | 13.69964433 |
| Min length | 1 |
Unique
| Unique | 2665 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | 4/5/2003 3:59:00 PM |
|---|---|
| 2nd row | 2007 or prior, based on filename of source data sheet |
| 3rd row | 14 Sep 2006 |
| 4th row | 10/11/2002 1:30:00 PM |
| 5th row | 11 May 2014 |
| Value | Count | Frequency (%) |
| may | 10963 | 3.6% |
| apr | 6723 | 2.2% |
| pm | 6654 | 2.2% |
| aug | 5888 | 1.9% |
| 5378 | 1.8% | |
| 2007 | 5232 | 1.7% |
| sep | 5187 | 1.7% |
| mar | 4910 | 1.6% |
| 2008 | 4661 | 1.5% |
| june | 4032 | 1.3% |
| Other values (3776) | 243090 |
Most occurring characters
| Value | Count | Frequency (%) |
| 200376 | 14.3% | |
| 0 | 159129 | 11.3% |
| 1 | 143433 | 10.2% |
| 2 | 117809 | 8.4% |
| 9 | 73246 | 5.2% |
| e | 40969 | 2.9% |
| 8 | 37322 | 2.7% |
| a | 35837 | 2.6% |
| 3 | 32888 | 2.3% |
| r | 32308 | 2.3% |
| Other values (66) | 528732 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 671471 | |
| Lowercase Letter | 314014 | |
| Space Separator | 200376 | 14.3% |
| Uppercase Letter | 112938 | 8.1% |
| Other Punctuation | 64411 | 4.6% |
| Dash Punctuation | 30846 | 2.2% |
| Open Punctuation | 3949 | 0.3% |
| Close Punctuation | 3949 | 0.3% |
| Math Symbol | 81 | < 0.1% |
| Connector Punctuation | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 40969 | |
| a | 35837 | |
| r | 32308 | |
| u | 27482 | |
| t | 25389 | 8.1% |
| p | 19868 | 6.3% |
| n | 18276 | 5.8% |
| y | 16752 | 5.3% |
| o | 15217 | 4.8% |
| c | 13242 | 4.2% |
| Other values (15) | 68674 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 25479 | |
| J | 20887 | |
| A | 19556 | |
| S | 12309 | |
| N | 7624 | 6.8% |
| P | 6958 | 6.2% |
| D | 5028 | 4.5% |
| O | 4899 | 4.3% |
| F | 4345 | 3.8% |
| E | 1480 | 1.3% |
| Other values (11) | 4373 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 32233 | |
| / | 13156 | |
| . | 8673 | 13.5% |
| ; | 8159 | 12.7% |
| , | 2132 | 3.3% |
| ? | 16 | < 0.1% |
| * | 15 | < 0.1% |
| ' | 9 | < 0.1% |
| & | 6 | < 0.1% |
| # | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 159129 | |
| 1 | 143433 | |
| 2 | 117809 | |
| 9 | 73246 | |
| 8 | 37322 | 5.6% |
| 3 | 32888 | 4.9% |
| 5 | 31315 | 4.7% |
| 7 | 28026 | 4.2% |
| 4 | 24321 | 3.6% |
| 6 | 23982 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30565 | |
| – | 281 | 0.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 3931 | |
| ( | 18 | 0.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 3931 | |
| ) | 18 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 200376 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 81 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 14 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 975097 | |
| Latin | 426952 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 40969 | 9.6% |
| a | 35837 | 8.4% |
| r | 32308 | 7.6% |
| u | 27482 | 6.4% |
| M | 25479 | 6.0% |
| t | 25389 | 5.9% |
| J | 20887 | 4.9% |
| p | 19868 | 4.7% |
| A | 19556 | 4.6% |
| n | 18276 | 4.3% |
| Other values (36) | 160901 |
Common
| Value | Count | Frequency (%) |
| 200376 | ||
| 0 | 159129 | |
| 1 | 143433 | |
| 2 | 117809 | |
| 9 | 73246 | 7.5% |
| 8 | 37322 | 3.8% |
| 3 | 32888 | 3.4% |
| : | 32233 | 3.3% |
| 5 | 31315 | 3.2% |
| - | 30565 | 3.1% |
| Other values (20) | 116781 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1401768 | |
| Punctuation | 281 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 200376 | 14.3% | |
| 0 | 159129 | 11.4% |
| 1 | 143433 | 10.2% |
| 2 | 117809 | 8.4% |
| 9 | 73246 | 5.2% |
| e | 40969 | 2.9% |
| 8 | 37322 | 2.7% |
| a | 35837 | 2.6% |
| 3 | 32888 | 2.3% |
| r | 32308 | 2.3% |
| Other values (65) | 528451 |
Punctuation
| Value | Count | Frequency (%) |
| – | 281 |
habitat
Text
Missing 
| Distinct | 5075 |
|---|---|
| Distinct (%) | 14.1% |
| Missing | 302334 |
| Missing (%) | 89.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 382 |
|---|---|
| Median length | 180 |
| Mean length | 39.97399324 |
| Min length | 1 |
Unique
| Unique | 1916 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | Rocky slope with scattered shrubs. Moist soil on slope |
|---|---|
| 2nd row | Scrubland |
| 3rd row | Ecological remarks by collector(s): yes |
| 4th row | Cultivated/garden |
| 5th row | brushed from under rubble |
| Value | Count | Frequency (%) |
| forest | 9242 | 4.6% |
| and | 8086 | 4.0% |
| with | 6443 | 3.2% |
| by | 4854 | 2.4% |
| ecological | 4350 | 2.2% |
| remarks | 4350 | 2.2% |
| collector(s | 4345 | 2.1% |
| in | 4302 | 2.1% |
| yes | 3549 | 1.8% |
| slopes | 2423 | 1.2% |
| Other values (4259) | 150202 |
Most occurring characters
| Value | Count | Frequency (%) |
| 166040 | 11.5% | |
| e | 123018 | 8.5% |
| a | 115153 | 8.0% |
| r | 97879 | 6.8% |
| o | 97117 | 6.7% |
| s | 87783 | 6.1% |
| i | 77223 | 5.4% |
| n | 74016 | 5.1% |
| t | 69226 | 4.8% |
| l | 65379 | 4.5% |
| Other values (77) | 470467 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1159779 | |
| Space Separator | 166040 | 11.5% |
| Uppercase Letter | 57713 | 4.0% |
| Other Punctuation | 44027 | 3.1% |
| Open Punctuation | 5088 | 0.4% |
| Close Punctuation | 5084 | 0.4% |
| Decimal Number | 3129 | 0.2% |
| Dash Punctuation | 2289 | 0.2% |
| Math Symbol | 151 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 123018 | |
| a | 115153 | 9.9% |
| r | 97879 | 8.4% |
| o | 97117 | 8.4% |
| s | 87783 | 7.6% |
| i | 77223 | 6.7% |
| n | 74016 | 6.4% |
| t | 69226 | 6.0% |
| l | 65379 | 5.6% |
| c | 53034 | 4.6% |
| Other values (17) | 299951 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 6028 | 10.4% |
| C | 5586 | 9.7% |
| S | 5345 | 9.3% |
| A | 5268 | 9.1% |
| P | 4287 | 7.4% |
| M | 4226 | 7.3% |
| R | 4148 | 7.2% |
| B | 3104 | 5.4% |
| D | 2370 | 4.1% |
| G | 2065 | 3.6% |
| Other values (16) | 15286 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 22821 | |
| . | 11654 | |
| : | 4682 | 10.6% |
| / | 2873 | 6.5% |
| ; | 1485 | 3.4% |
| & | 182 | 0.4% |
| % | 121 | 0.3% |
| " | 101 | 0.2% |
| ? | 72 | 0.2% |
| ' | 24 | 0.1% |
| Other values (2) | 12 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 715 | |
| 1 | 509 | |
| 2 | 394 | |
| 5 | 351 | |
| 3 | 259 | 8.3% |
| 8 | 219 | 7.0% |
| 4 | 197 | 6.3% |
| 6 | 173 | 5.5% |
| 7 | 162 | 5.2% |
| 9 | 150 | 4.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2277 | |
| — | 8 | 0.3% |
| – | 4 | 0.2% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 138 | |
| + | 8 | 5.3% |
| < | 5 | 3.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5052 | |
| [ | 36 | 0.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5048 | |
| ] | 36 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 166040 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1217492 | |
| Common | 225809 | 15.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 123018 | 10.1% |
| a | 115153 | 9.5% |
| r | 97879 | 8.0% |
| o | 97117 | 8.0% |
| s | 87783 | 7.2% |
| i | 77223 | 6.3% |
| n | 74016 | 6.1% |
| t | 69226 | 5.7% |
| l | 65379 | 5.4% |
| c | 53034 | 4.4% |
| Other values (43) | 357664 |
Common
| Value | Count | Frequency (%) |
| 166040 | ||
| , | 22821 | 10.1% |
| . | 11654 | 5.2% |
| ( | 5052 | 2.2% |
| ) | 5048 | 2.2% |
| : | 4682 | 2.1% |
| / | 2873 | 1.3% |
| - | 2277 | 1.0% |
| ; | 1485 | 0.7% |
| 0 | 715 | 0.3% |
| Other values (24) | 3162 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1443283 | |
| Punctuation | 12 | < 0.1% |
| None | 6 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 166040 | 11.5% | |
| e | 123018 | 8.5% |
| a | 115153 | 8.0% |
| r | 97879 | 6.8% |
| o | 97117 | 6.7% |
| s | 87783 | 6.1% |
| i | 77223 | 5.4% |
| n | 74016 | 5.1% |
| t | 69226 | 4.8% |
| l | 65379 | 4.5% |
| Other values (74) | 470449 |
Punctuation
| Value | Count | Frequency (%) |
| — | 8 | |
| – | 4 |
None
| Value | Count | Frequency (%) |
| ñ | 6 |
eventRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338439 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 96 |
|---|---|
| Median length | 96 |
| Mean length | 96 |
| Min length | 96 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Guide to Best Practices for Georeferencing. (Chapman and Wieczorek, eds. 2006). Google Earth Pro |
|---|
| Value | Count | Frequency (%) |
| guide | 1 | 7.1% |
| to | 1 | 7.1% |
| best | 1 | 7.1% |
| practices | 1 | 7.1% |
| for | 1 | 7.1% |
| georeferencing | 1 | 7.1% |
| chapman | 1 | 7.1% |
| and | 1 | 7.1% |
| wieczorek | 1 | 7.1% |
| eds | 1 | 7.1% |
| Other values (4) | 4 |
Most occurring characters
| Value | Count | Frequency (%) |
| 13 | 13.5% | |
| e | 11 | 11.5% |
| o | 7 | 7.3% |
| r | 7 | 7.3% |
| a | 5 | 5.2% |
| n | 4 | 4.2% |
| i | 4 | 4.2% |
| t | 4 | 4.2% |
| c | 4 | 4.2% |
| G | 3 | 3.1% |
| Other values (23) | 34 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 64 | |
| Space Separator | 13 | 13.5% |
| Uppercase Letter | 9 | 9.4% |
| Other Punctuation | 4 | 4.2% |
| Decimal Number | 4 | 4.2% |
| Close Punctuation | 1 | 1.0% |
| Open Punctuation | 1 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 7 | |
| r | 7 | |
| a | 5 | |
| n | 4 | 6.2% |
| i | 4 | 6.2% |
| t | 4 | 6.2% |
| c | 4 | 6.2% |
| d | 3 | 4.7% |
| s | 3 | 4.7% |
| Other values (9) | 12 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 3 | |
| P | 2 | |
| B | 1 | 11.1% |
| W | 1 | 11.1% |
| C | 1 | 11.1% |
| E | 1 | 11.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 6 | 1 | |
| 2 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3 | |
| , | 1 | 25.0% |
Space Separator
| Value | Count | Frequency (%) |
| 13 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 73 | |
| Common | 23 | 24.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11 | |
| o | 7 | 9.6% |
| r | 7 | 9.6% |
| a | 5 | 6.8% |
| n | 4 | 5.5% |
| i | 4 | 5.5% |
| t | 4 | 5.5% |
| c | 4 | 5.5% |
| G | 3 | 4.1% |
| d | 3 | 4.1% |
| Other values (15) | 21 |
Common
| Value | Count | Frequency (%) |
| 13 | ||
| . | 3 | 13.0% |
| 0 | 2 | 8.7% |
| , | 1 | 4.3% |
| ) | 1 | 4.3% |
| 6 | 1 | 4.3% |
| 2 | 1 | 4.3% |
| ( | 1 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 96 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 13 | 13.5% | |
| e | 11 | 11.5% |
| o | 7 | 7.3% |
| r | 7 | 7.3% |
| a | 5 | 5.2% |
| n | 4 | 4.2% |
| i | 4 | 4.2% |
| t | 4 | 4.2% |
| c | 4 | 4.2% |
| G | 3 | 3.1% |
| Other values (23) | 34 |
locationID
Text
Missing 
| Distinct | 4571 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 284922 |
| Missing (%) | 84.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 6.812642475 |
| Min length | 1 |
Unique
| Unique | 1199 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | T548 |
|---|---|
| 2nd row | BIZ-231 |
| 3rd row | T488 |
| 4th row | 02-10 |
| 5th row | VES117 |
| Value | Count | Frequency (%) |
| 080611_minv_014 | 627 | 1.1% |
| site | 469 | 0.8% |
| i | 457 | 0.8% |
| trawl | 456 | 0.8% |
| serc | 326 | 0.6% |
| 14 | 313 | 0.6% |
| v1951 | 309 | 0.5% |
| 080608_minv_012 | 289 | 0.5% |
| 21 | 276 | 0.5% |
| 10 | 275 | 0.5% |
| Other values (4452) | 53080 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 37162 | 10.2% |
| 1 | 34715 | 9.5% |
| - | 19207 | 5.3% |
| 2 | 18298 | 5.0% |
| I | 15967 | 4.4% |
| _ | 15386 | 4.2% |
| 5 | 13787 | 3.8% |
| 4 | 13700 | 3.8% |
| 8 | 13275 | 3.6% |
| 6 | 12677 | 3.5% |
| Other values (73) | 170425 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 176632 | |
| Uppercase Letter | 123892 | |
| Lowercase Letter | 23399 | 6.4% |
| Dash Punctuation | 19207 | 5.3% |
| Connector Punctuation | 15386 | 4.2% |
| Space Separator | 3359 | 0.9% |
| Other Punctuation | 2274 | 0.6% |
| Open Punctuation | 203 | 0.1% |
| Close Punctuation | 202 | 0.1% |
| Math Symbol | 40 | < 0.1% |
| Other values (2) | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 15967 | |
| A | 12422 | 10.0% |
| B | 12233 | 9.9% |
| S | 9801 | 7.9% |
| M | 9595 | 7.7% |
| T | 7531 | 6.1% |
| Z | 6274 | 5.1% |
| O | 5823 | 4.7% |
| N | 5746 | 4.6% |
| V | 4404 | 3.6% |
| Other values (18) | 34096 |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 2393 | |
| i | 2331 | |
| e | 1964 | 8.4% |
| m | 1947 | 8.3% |
| o | 1871 | 8.0% |
| a | 1852 | 7.9% |
| t | 1743 | 7.4% |
| r | 1556 | 6.6% |
| v | 1293 | 5.5% |
| g | 941 | 4.0% |
| Other values (17) | 5508 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 37162 | |
| 1 | 34715 | |
| 2 | 18298 | |
| 5 | 13787 | 7.8% |
| 4 | 13700 | 7.8% |
| 8 | 13275 | 7.5% |
| 6 | 12677 | 7.2% |
| 3 | 12433 | 7.0% |
| 7 | 11387 | 6.4% |
| 9 | 9198 | 5.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1119 | |
| . | 1044 | |
| # | 109 | 4.8% |
| , | 2 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 196 | |
| [ | 6 | 3.0% |
| ‚ | 1 | 0.5% |
Math Symbol
| Value | Count | Frequency (%) |
| > | 37 | |
| ¬ | 2 | 5.0% |
| + | 1 | 2.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 196 | |
| ] | 6 | 3.0% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 3 | |
| € | 1 | 25.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19207 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 15386 |
Space Separator
| Value | Count | Frequency (%) |
| 3359 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 217308 | |
| Latin | 147291 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| I | 15967 | 10.8% |
| A | 12422 | 8.4% |
| B | 12233 | 8.3% |
| S | 9801 | 6.7% |
| M | 9595 | 6.5% |
| T | 7531 | 5.1% |
| Z | 6274 | 4.3% |
| O | 5823 | 4.0% |
| N | 5746 | 3.9% |
| V | 4404 | 3.0% |
| Other values (45) | 57495 |
Common
| Value | Count | Frequency (%) |
| 0 | 37162 | |
| 1 | 34715 | |
| - | 19207 | |
| 2 | 18298 | |
| _ | 15386 | |
| 5 | 13787 | 6.3% |
| 4 | 13700 | 6.3% |
| 8 | 13275 | 6.1% |
| 6 | 12677 | 5.8% |
| 3 | 12433 | 5.7% |
| Other values (18) | 26668 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 364581 | |
| None | 15 | < 0.1% |
| Punctuation | 2 | < 0.1% |
| Currency Symbols | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 37162 | 10.2% |
| 1 | 34715 | 9.5% |
| - | 19207 | 5.3% |
| 2 | 18298 | 5.0% |
| I | 15967 | 4.4% |
| _ | 15386 | 4.2% |
| 5 | 13787 | 3.8% |
| 4 | 13700 | 3.8% |
| 8 | 13275 | 3.6% |
| 6 | 12677 | 3.5% |
| Other values (62) | 170407 |
None
| Value | Count | Frequency (%) |
| Ã | 3 | |
| ¢ | 3 | |
| Â | 2 | |
| â | 2 | |
| ¬ | 2 | |
| ƒ | 1 | 6.7% |
| š | 1 | 6.7% |
| Å | 1 | 6.7% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 1 |
Punctuation
| Value | Count | Frequency (%) |
| ‚ | 1 | |
| “ | 1 |
higherGeography
Text
Missing 
| Distinct | 7780 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 4534 |
| Missing (%) | 1.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 128 |
|---|---|
| Median length | 103 |
| Mean length | 44.48238426 |
| Min length | 3 |
Unique
| Unique | 788 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | United States, Arizona, Cochise |
|---|---|
| 2nd row | North Pacific Ocean, Gulf of California, Mexico |
| 3rd row | South Pacific Ocean, French Polynesia, Society Islands, Moorea |
| 4th row | United States, Arkansas |
| 5th row | Asia-Temperate, China, Xizang, Nielamu (Nyalam) Xian |
| Value | Count | Frequency (%) |
| states | 150874 | 7.6% |
| united | 150796 | 7.6% |
| north | 101914 | 5.1% |
| ocean | 69469 | 3.5% |
| pacific | 66315 | 3.4% |
| america | 65503 | 3.3% |
| stated | 60374 | 3.0% |
| not | 60374 | 3.0% |
| islands | 44123 | 2.2% |
| atlantic | 41421 | 2.1% |
| Other values (4526) | 1168373 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1645630 | 11.1% | |
| a | 1473689 | 9.9% |
| t | 1109174 | 7.5% |
| e | 1085474 | 7.3% |
| i | 1041776 | 7.0% |
| n | 861843 | 5.8% |
| , | 826533 | 5.6% |
| o | 731933 | 4.9% |
| r | 620649 | 4.2% |
| s | 542408 | 3.7% |
| Other values (88) | 4913826 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10257330 | |
| Uppercase Letter | 1954366 | 13.2% |
| Space Separator | 1645630 | 11.1% |
| Other Punctuation | 837556 | 5.6% |
| Close Punctuation | 63081 | 0.4% |
| Open Punctuation | 63081 | 0.4% |
| Dash Punctuation | 30906 | 0.2% |
| Modifier Letter | 813 | < 0.1% |
| Decimal Number | 169 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1473689 | |
| t | 1109174 | |
| e | 1085474 | |
| i | 1041776 | |
| n | 861843 | |
| o | 731933 | 7.1% |
| r | 620649 | 6.1% |
| s | 542408 | 5.3% |
| c | 501353 | 4.9% |
| l | 396979 | 3.9% |
| Other values (36) | 1892052 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 344712 | |
| N | 206484 | |
| A | 201040 | |
| C | 181503 | |
| U | 160381 | |
| P | 159171 | |
| M | 93276 | 4.8% |
| O | 87982 | 4.5% |
| B | 72896 | 3.7% |
| I | 68366 | 3.5% |
| Other values (20) | 378555 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 826533 | |
| . | 7811 | 0.9% |
| ' | 2815 | 0.3% |
| ? | 201 | < 0.1% |
| / | 190 | < 0.1% |
| * | 5 | < 0.1% |
| ; | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 108 | |
| 1 | 24 | 14.2% |
| 2 | 16 | 9.5% |
| 9 | 13 | 7.7% |
| 0 | 8 | 4.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30890 | |
| – | 10 | < 0.1% |
| — | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 61158 | |
| ) | 1923 | 3.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 61158 | |
| ( | 1923 | 3.0% |
Space Separator
| Value | Count | Frequency (%) |
| 1645630 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 813 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12211696 | |
| Common | 2641239 | 17.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1473689 | 12.1% |
| t | 1109174 | 9.1% |
| e | 1085474 | 8.9% |
| i | 1041776 | 8.5% |
| n | 861843 | 7.1% |
| o | 731933 | 6.0% |
| r | 620649 | 5.1% |
| s | 542408 | 4.4% |
| c | 501353 | 4.1% |
| l | 396979 | 3.3% |
| Other values (66) | 3846418 |
Common
| Value | Count | Frequency (%) |
| 1645630 | ||
| , | 826533 | |
| ] | 61158 | 2.3% |
| [ | 61158 | 2.3% |
| - | 30890 | 1.2% |
| . | 7811 | 0.3% |
| ' | 2815 | 0.1% |
| ) | 1923 | 0.1% |
| ( | 1923 | 0.1% |
| ʻ | 813 | < 0.1% |
| Other values (12) | 585 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14839267 | |
| None | 12839 | 0.1% |
| Modifier Letters | 813 | < 0.1% |
| Punctuation | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1645630 | 11.1% | |
| a | 1473689 | 9.9% |
| t | 1109174 | 7.5% |
| e | 1085474 | 7.3% |
| i | 1041776 | 7.0% |
| n | 861843 | 5.8% |
| , | 826533 | 5.6% |
| o | 731933 | 4.9% |
| r | 620649 | 4.2% |
| s | 542408 | 3.7% |
| Other values (61) | 4900158 |
None
| Value | Count | Frequency (%) |
| é | 3479 | |
| í | 2115 | |
| ã | 1908 | |
| Î | 1381 | 10.8% |
| ó | 1026 | 8.0% |
| ā | 813 | 6.3% |
| ç | 805 | 6.3% |
| á | 431 | 3.4% |
| ä | 239 | 1.9% |
| ö | 194 | 1.5% |
| Other values (14) | 448 | 3.5% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 813 |
Punctuation
| Value | Count | Frequency (%) |
| – | 10 | |
| — | 6 |
continent
Text
Missing 
| Distinct | 65 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 144951 |
| Missing (%) | 42.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 50 |
|---|---|
| Median length | 46 |
| Mean length | 15.26976727 |
| Min length | 4 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Pacific Ocean |
|---|---|
| 2nd row | South Pacific Ocean |
| 3rd row | Asia-Temperate |
| 4th row | North Atlantic Ocean |
| 5th row | Pacific |
| Value | Count | Frequency (%) |
| north | 93980 | |
| ocean | 69217 | |
| pacific | 66248 | |
| america | 65503 | |
| atlantic | 41364 | |
| south | 31475 | 7.1% |
| 13897 | 3.1% | |
| neotropics | 13896 | 3.1% |
| asia | 9238 | 2.1% |
| africa | 8531 | 1.9% |
| Other values (18) | 28368 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 336991 | |
| a | 291503 | 9.9% |
| i | 287727 | 9.7% |
| 248228 | 8.4% | |
| t | 235357 | 8.0% |
| r | 194360 | 6.6% |
| e | 173294 | 5.9% |
| o | 157410 | 5.3% |
| n | 131417 | 4.4% |
| A | 131273 | 4.4% |
| Other values (25) | 766972 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2240751 | |
| Uppercase Letter | 433353 | 14.7% |
| Space Separator | 248228 | 8.4% |
| Dash Punctuation | 19474 | 0.7% |
| Other Punctuation | 12726 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 336991 | |
| a | 291503 | |
| i | 287727 | |
| t | 235357 | |
| r | 194360 | |
| e | 173294 | |
| o | 157410 | |
| n | 131417 | 5.9% |
| h | 125782 | 5.6% |
| f | 74779 | 3.3% |
| Other values (9) | 232131 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 131273 | |
| N | 107876 | |
| O | 72337 | |
| P | 66248 | |
| S | 31802 | 7.3% |
| I | 8809 | 2.0% |
| T | 5570 | 1.3% |
| C | 5068 | 1.2% |
| W | 2692 | 0.6% |
| L | 635 | 0.1% |
| Other values (2) | 1043 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 12725 | |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 248228 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 19474 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2674104 | |
| Common | 280428 | 9.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 336991 | |
| a | 291503 | |
| i | 287727 | |
| t | 235357 | 8.8% |
| r | 194360 | 7.3% |
| e | 173294 | 6.5% |
| o | 157410 | 5.9% |
| n | 131417 | 4.9% |
| A | 131273 | 4.9% |
| h | 125782 | 4.7% |
| Other values (21) | 608990 |
Common
| Value | Count | Frequency (%) |
| 248228 | ||
| - | 19474 | 6.9% |
| , | 12725 | 4.5% |
| ? | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2954532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 336991 | |
| a | 291503 | 9.9% |
| i | 287727 | 9.7% |
| 248228 | 8.4% | |
| t | 235357 | 8.0% |
| r | 194360 | 6.6% |
| e | 173294 | 5.9% |
| o | 157410 | 5.3% |
| n | 131417 | 4.4% |
| A | 131273 | 4.4% |
| Other values (25) | 766972 |
waterBody
Text
Missing 
| Distinct | 218 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 231595 |
| Missing (%) | 68.4% |
| Memory size | 2.6 MiB |
Length
| Max length | 101 |
|---|---|
| Median length | 55 |
| Mean length | 20.41994478 |
| Min length | 6 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North Pacific Ocean, Gulf of California |
|---|---|
| 2nd row | South Pacific Ocean |
| 3rd row | North Atlantic Ocean |
| 4th row | Pacific |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 69218 | |
| pacific | 61625 | |
| north | 47128 | |
| atlantic | 41364 | |
| south | 18416 | 5.6% |
| sea | 18259 | 5.6% |
| caribbean | 14747 | 4.5% |
| bay | 12126 | 3.7% |
| gulf | 7272 | 2.2% |
| of | 6755 | 2.1% |
| Other values (211) | 31235 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 263749 | |
| c | 237824 | |
| 221300 | 10.1% | |
| i | 195326 | 9.0% |
| t | 153144 | 7.0% |
| n | 144281 | 6.6% |
| e | 133436 | 6.1% |
| o | 88007 | 4.0% |
| f | 78821 | 3.6% |
| h | 75935 | 3.5% |
| Other values (49) | 589946 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1599886 | |
| Uppercase Letter | 321575 | 14.7% |
| Space Separator | 221300 | 10.1% |
| Other Punctuation | 38190 | 1.8% |
| Modifier Letter | 813 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 263749 | |
| c | 237824 | |
| i | 195326 | |
| t | 153144 | |
| n | 144281 | |
| e | 133436 | |
| o | 88007 | 5.5% |
| f | 78821 | 4.9% |
| h | 75935 | 4.7% |
| r | 70399 | 4.4% |
| Other values (16) | 158964 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 69719 | |
| P | 64694 | |
| N | 47138 | |
| A | 42035 | |
| S | 38955 | |
| C | 22310 | 6.9% |
| B | 14241 | 4.4% |
| G | 7328 | 2.3% |
| K | 4835 | 1.5% |
| M | 3990 | 1.2% |
| Other values (14) | 6330 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 38110 | |
| ' | 73 | 0.2% |
| . | 6 | < 0.1% |
| ; | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 221300 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 813 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1921461 | |
| Common | 260308 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 263749 | |
| c | 237824 | |
| i | 195326 | |
| t | 153144 | 8.0% |
| n | 144281 | 7.5% |
| e | 133436 | 6.9% |
| o | 88007 | 4.6% |
| f | 78821 | 4.1% |
| h | 75935 | 4.0% |
| r | 70399 | 3.7% |
| Other values (40) | 480539 |
Common
| Value | Count | Frequency (%) |
| 221300 | ||
| , | 38110 | 14.6% |
| ʻ | 813 | 0.3% |
| ' | 73 | < 0.1% |
| . | 6 | < 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
| ; | 1 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2180143 | |
| None | 813 | < 0.1% |
| Modifier Letters | 813 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 263749 | |
| c | 237824 | |
| 221300 | 10.2% | |
| i | 195326 | 9.0% |
| t | 153144 | 7.0% |
| n | 144281 | 6.6% |
| e | 133436 | 6.1% |
| o | 88007 | 4.0% |
| f | 78821 | 3.6% |
| h | 75935 | 3.5% |
| Other values (47) | 588320 |
None
| Value | Count | Frequency (%) |
| ā | 813 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 813 |
islandGroup
Text
Missing 
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 315692 |
| Missing (%) | 93.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 21 |
| Mean length | 14.50536311 |
| Min length | 5 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Society Islands |
|---|---|
| 2nd row | Leeward Antilles |
| 3rd row | Bahama Islands |
| 4th row | Society Islands |
| 5th row | Visayas |
| Value | Count | Frequency (%) |
| islands | 15126 | |
| society | 10385 | |
| leeward | 3586 | 7.6% |
| antilles | 3195 | 6.7% |
| îles | 1364 | 2.9% |
| vent | 1364 | 2.9% |
| du | 1303 | 2.7% |
| cays | 1105 | 2.3% |
| bahama | 991 | 2.1% |
| group | 828 | 1.7% |
| Other values (103) | 8219 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 39254 | |
| a | 28836 | 8.7% |
| e | 28070 | 8.5% |
| 24718 | 7.5% | |
| l | 24500 | 7.4% |
| n | 22729 | 6.9% |
| d | 21963 | 6.7% |
| i | 17641 | 5.3% |
| t | 16063 | 4.9% |
| I | 15514 | 4.7% |
| Other values (41) | 90680 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 257173 | |
| Uppercase Letter | 47716 | 14.5% |
| Space Separator | 24718 | 7.5% |
| Other Punctuation | 361 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 39254 | |
| a | 28836 | |
| e | 28070 | |
| l | 24500 | |
| n | 22729 | |
| d | 21963 | |
| i | 17641 | |
| t | 16063 | |
| o | 12693 | 4.9% |
| y | 11946 | 4.6% |
| Other values (15) | 33478 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 15514 | |
| S | 11295 | |
| L | 4429 | 9.3% |
| A | 4273 | 9.0% |
| V | 2164 | 4.5% |
| B | 2064 | 4.3% |
| C | 2048 | 4.3% |
| Î | 1364 | 2.9% |
| P | 926 | 1.9% |
| G | 917 | 1.9% |
| Other values (13) | 2722 | 5.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 353 | |
| ' | 8 | 2.2% |
Space Separator
| Value | Count | Frequency (%) |
| 24718 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 304889 | |
| Common | 25079 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 39254 | |
| a | 28836 | 9.5% |
| e | 28070 | 9.2% |
| l | 24500 | 8.0% |
| n | 22729 | 7.5% |
| d | 21963 | 7.2% |
| i | 17641 | 5.8% |
| t | 16063 | 5.3% |
| I | 15514 | 5.1% |
| o | 12693 | 4.2% |
| Other values (38) | 77626 |
Common
| Value | Count | Frequency (%) |
| 24718 | ||
| . | 353 | 1.4% |
| ' | 8 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 328604 | |
| None | 1364 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 39254 | |
| a | 28836 | 8.8% |
| e | 28070 | 8.5% |
| 24718 | 7.5% | |
| l | 24500 | 7.5% |
| n | 22729 | 6.9% |
| d | 21963 | 6.7% |
| i | 17641 | 5.4% |
| t | 16063 | 4.9% |
| I | 15514 | 4.7% |
| Other values (40) | 89316 |
None
| Value | Count | Frequency (%) |
| Î | 1364 |
island
Text
Missing 
| Distinct | 566 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 279541 |
| Missing (%) | 82.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 25 |
| Mean length | 8.431569297 |
| Min length | 3 |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Moorea |
|---|---|
| 2nd row | Moorea |
| 3rd row | Mindanao |
| 4th row | Klein Curacao |
| 5th row | Moorea |
| Value | Count | Frequency (%) |
| moorea | 15960 | |
| cay | 7347 | 8.5% |
| carrie | 4788 | 5.5% |
| bow | 4788 | 5.5% |
| island | 4068 | 4.7% |
| curacao | 3681 | 4.3% |
| oahu | 2250 | 2.6% |
| luzon | 2090 | 2.4% |
| borneo | 2047 | 2.4% |
| atoll | 915 | 1.1% |
| Other values (560) | 38504 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 78616 | |
| o | 63718 | |
| r | 44941 | 9.0% |
| e | 38317 | 7.7% |
| 27539 | 5.5% | |
| u | 21082 | 4.2% |
| n | 20978 | 4.2% |
| i | 20856 | 4.2% |
| C | 19943 | 4.0% |
| M | 19709 | 4.0% |
| Other values (52) | 140912 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 380361 | |
| Uppercase Letter | 86167 | 17.4% |
| Space Separator | 27539 | 5.5% |
| Close Punctuation | 802 | 0.2% |
| Open Punctuation | 802 | 0.2% |
| Other Punctuation | 782 | 0.2% |
| Dash Punctuation | 158 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 78616 | |
| o | 63718 | |
| r | 44941 | |
| e | 38317 | |
| u | 21082 | 5.5% |
| n | 20978 | 5.5% |
| i | 20856 | 5.5% |
| l | 12017 | 3.2% |
| s | 11399 | 3.0% |
| y | 10671 | 2.8% |
| Other values (19) | 57766 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 19943 | |
| M | 19709 | |
| B | 8939 | |
| I | 4682 | 5.4% |
| T | 4468 | 5.2% |
| S | 3549 | 4.1% |
| L | 3261 | 3.8% |
| H | 2578 | 3.0% |
| O | 2571 | 3.0% |
| P | 2470 | 2.9% |
| Other values (16) | 13997 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 707 | |
| . | 73 | 9.3% |
| , | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 27539 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 802 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 802 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 158 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 466528 | |
| Common | 30083 | 6.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 78616 | |
| o | 63718 | |
| r | 44941 | 9.6% |
| e | 38317 | 8.2% |
| u | 21082 | 4.5% |
| n | 20978 | 4.5% |
| i | 20856 | 4.5% |
| C | 19943 | 4.3% |
| M | 19709 | 4.2% |
| l | 12017 | 2.6% |
| Other values (45) | 126351 |
Common
| Value | Count | Frequency (%) |
| 27539 | ||
| ] | 802 | 2.7% |
| [ | 802 | 2.7% |
| ' | 707 | 2.4% |
| - | 158 | 0.5% |
| . | 73 | 0.2% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 496161 | |
| None | 450 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 78616 | |
| o | 63718 | |
| r | 44941 | 9.1% |
| e | 38317 | 7.7% |
| 27539 | 5.6% | |
| u | 21082 | 4.2% |
| n | 20978 | 4.2% |
| i | 20856 | 4.2% |
| C | 19943 | 4.0% |
| M | 19709 | 4.0% |
| Other values (47) | 140462 |
None
| Value | Count | Frequency (%) |
| ç | 380 | |
| ó | 34 | 7.6% |
| ò | 19 | 4.2% |
| Î | 14 | 3.1% |
| Ž | 3 | 0.7% |
country
Text
Missing 
| Distinct | 244 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 14430 |
| Missing (%) | 4.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 36 |
| Mean length | 11.00677448 |
| Min length | 4 |
Unique
| Unique | 8 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Mexico |
| 3rd row | French Polynesia |
| 4th row | United States |
| 5th row | China |
| Value | Count | Frequency (%) |
| states | 150853 | |
| united | 150769 | |
| french | 23145 | 4.3% |
| polynesia | 22963 | 4.3% |
| mexico | 9713 | 1.8% |
| panama | 9216 | 1.7% |
| belize | 9195 | 1.7% |
| philippines | 6781 | 1.3% |
| guyana | 5999 | 1.1% |
| new | 5306 | 1.0% |
| Other values (266) | 142214 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 480575 | |
| e | 435115 | |
| a | 385179 | |
| i | 291078 | 8.2% |
| n | 286223 | 8.0% |
| 212144 | 5.9% | |
| s | 206350 | 5.8% |
| d | 173347 | 4.9% |
| S | 162516 | 4.6% |
| U | 152569 | 4.3% |
| Other values (56) | 781209 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2819834 | |
| Uppercase Letter | 531078 | 14.9% |
| Space Separator | 212144 | 5.9% |
| Other Punctuation | 2066 | 0.1% |
| Dash Punctuation | 595 | < 0.1% |
| Close Punctuation | 294 | < 0.1% |
| Open Punctuation | 294 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 480575 | |
| e | 435115 | |
| a | 385179 | |
| i | 291078 | |
| n | 286223 | |
| s | 206350 | |
| d | 173347 | 6.1% |
| o | 77020 | 2.7% |
| r | 74598 | 2.6% |
| l | 73008 | 2.6% |
| Other values (20) | 337341 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 162516 | |
| U | 152569 | |
| P | 49504 | 9.3% |
| F | 26660 | 5.0% |
| C | 22198 | 4.2% |
| B | 21697 | 4.1% |
| M | 21674 | 4.1% |
| G | 14861 | 2.8% |
| A | 8850 | 1.7% |
| T | 8420 | 1.6% |
| Other values (15) | 42129 | 7.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1670 | |
| . | 280 | 13.6% |
| ' | 65 | 3.1% |
| ? | 50 | 2.4% |
| / | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 240 | |
| ] | 54 | 18.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 240 | |
| [ | 54 | 18.4% |
Space Separator
| Value | Count | Frequency (%) |
| 212144 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 595 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3350912 | |
| Common | 215393 | 6.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 480575 | |
| e | 435115 | |
| a | 385179 | |
| i | 291078 | |
| n | 286223 | |
| s | 206350 | 6.2% |
| d | 173347 | 5.2% |
| S | 162516 | 4.8% |
| U | 152569 | 4.6% |
| o | 77020 | 2.3% |
| Other values (45) | 700940 |
Common
| Value | Count | Frequency (%) |
| 212144 | ||
| , | 1670 | 0.8% |
| - | 595 | 0.3% |
| . | 280 | 0.1% |
| ) | 240 | 0.1% |
| ( | 240 | 0.1% |
| ' | 65 | < 0.1% |
| [ | 54 | < 0.1% |
| ] | 54 | < 0.1% |
| ? | 50 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3563223 | |
| None | 3082 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 480575 | |
| e | 435115 | |
| a | 385179 | |
| i | 291078 | 8.2% |
| n | 286223 | 8.0% |
| 212144 | 6.0% | |
| s | 206350 | 5.8% |
| d | 173347 | 4.9% |
| S | 162516 | 4.6% |
| U | 152569 | 4.3% |
| Other values (52) | 778127 |
None
| Value | Count | Frequency (%) |
| é | 912 | |
| í | 885 | |
| ã | 885 | |
| ç | 400 |
stateProvince
Text
Missing 
| Distinct | 1646 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 66214 |
| Missing (%) | 19.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 52 |
|---|---|
| Median length | 42 |
| Mean length | 9.616215938 |
| Min length | 3 |
Unique
| Unique | 68 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arizona |
|---|---|
| 2nd row | Arkansas |
| 3rd row | Xizang |
| 4th row | Laikipia |
| 5th row | Florida |
| Value | Count | Frequency (%) |
| california | 17069 | 4.6% |
| florida | 16485 | 4.4% |
| texas | 14332 | 3.9% |
| virginia | 13045 | 3.5% |
| not | 10639 | 2.9% |
| stated | 10639 | 2.9% |
| arizona | 9691 | 2.6% |
| carolina | 8854 | 2.4% |
| region | 8372 | 2.3% |
| new | 8072 | 2.2% |
| Other values (1667) | 253746 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 361759 | |
| i | 257226 | 9.8% |
| n | 192597 | 7.4% |
| o | 191176 | 7.3% |
| r | 175302 | 6.7% |
| e | 143640 | 5.5% |
| s | 116938 | 4.5% |
| t | 109100 | 4.2% |
| l | 104596 | 4.0% |
| 98718 | 3.8% | |
| Other values (72) | 866732 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2114743 | |
| Uppercase Letter | 371148 | 14.2% |
| Space Separator | 98718 | 3.8% |
| Open Punctuation | 10915 | 0.4% |
| Close Punctuation | 10915 | 0.4% |
| Dash Punctuation | 8617 | 0.3% |
| Other Punctuation | 2607 | 0.1% |
| Decimal Number | 121 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 361759 | |
| i | 257226 | |
| n | 192597 | |
| o | 191176 | |
| r | 175302 | |
| e | 143640 | 6.8% |
| s | 116938 | 5.5% |
| t | 109100 | 5.2% |
| l | 104596 | 4.9% |
| u | 68939 | 3.3% |
| Other values (31) | 393470 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 47952 | |
| T | 35494 | 9.6% |
| S | 34273 | 9.2% |
| N | 33681 | 9.1% |
| M | 32065 | 8.6% |
| A | 24621 | 6.6% |
| F | 18726 | 5.0% |
| V | 16298 | 4.4% |
| P | 16190 | 4.4% |
| I | 12017 | 3.2% |
| Other values (18) | 99831 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2199 | |
| ' | 211 | 8.1% |
| / | 93 | 3.6% |
| , | 61 | 2.3% |
| ? | 43 | 1.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 10615 | |
| ( | 300 | 2.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 10615 | |
| ) | 300 | 2.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 108 | |
| 9 | 13 | 10.7% |
Space Separator
| Value | Count | Frequency (%) |
| 98718 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8617 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2485891 | |
| Common | 131893 | 5.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 361759 | |
| i | 257226 | 10.3% |
| n | 192597 | 7.7% |
| o | 191176 | 7.7% |
| r | 175302 | 7.1% |
| e | 143640 | 5.8% |
| s | 116938 | 4.7% |
| t | 109100 | 4.4% |
| l | 104596 | 4.2% |
| u | 68939 | 2.8% |
| Other values (59) | 764618 |
Common
| Value | Count | Frequency (%) |
| 98718 | ||
| [ | 10615 | 8.0% |
| ] | 10615 | 8.0% |
| - | 8617 | 6.5% |
| . | 2199 | 1.7% |
| ( | 300 | 0.2% |
| ) | 300 | 0.2% |
| ' | 211 | 0.2% |
| 3 | 108 | 0.1% |
| / | 93 | 0.1% |
| Other values (3) | 117 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2611543 | |
| None | 6241 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 361759 | |
| i | 257226 | 9.8% |
| n | 192597 | 7.4% |
| o | 191176 | 7.3% |
| r | 175302 | 6.7% |
| e | 143640 | 5.5% |
| s | 116938 | 4.5% |
| t | 109100 | 4.2% |
| l | 104596 | 4.0% |
| 98718 | 3.8% | |
| Other values (55) | 860491 |
None
| Value | Count | Frequency (%) |
| é | 2427 | |
| ã | 978 | |
| ó | 951 | 15.2% |
| í | 870 | 13.9% |
| á | 390 | 6.2% |
| ä | 239 | 3.8% |
| ö | 185 | 3.0% |
| ñ | 88 | 1.4% |
| ô | 45 | 0.7% |
| ü | 17 | 0.3% |
| Other values (7) | 51 | 0.8% |
county
Text
Missing 
| Distinct | 3053 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 140615 |
| Missing (%) | 41.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 56 |
|---|---|
| Median length | 35 |
| Mean length | 10.83488437 |
| Min length | 1 |
Unique
| Unique | 295 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Cochise |
|---|---|
| 2nd row | Nielamu (Nyalam) Xian |
| 3rd row | [Not Stated] |
| 4th row | [Not Stated] |
| 5th row | [Not Stated] |
| Value | Count | Frequency (%) |
| not | 49678 | 15.0% |
| stated | 49678 | 15.0% |
| county | 38512 | 11.6% |
| honolulu | 5036 | 1.5% |
| san | 4616 | 1.4% |
| st | 3591 | 1.1% |
| cochise | 3342 | 1.0% |
| lucie | 3228 | 1.0% |
| island | 2684 | 0.8% |
| xian | 2352 | 0.7% |
| Other values (2542) | 169101 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 236941 | 11.1% |
| o | 188038 | 8.8% |
| a | 183246 | 8.5% |
| e | 150194 | 7.0% |
| n | 138867 | 6.5% |
| 133993 | 6.3% | |
| u | 85315 | 4.0% |
| i | 84625 | 3.9% |
| d | 76684 | 3.6% |
| r | 76363 | 3.6% |
| Other values (73) | 789145 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1568693 | |
| Uppercase Letter | 330147 | 15.4% |
| Space Separator | 133993 | 6.3% |
| Open Punctuation | 50909 | 2.4% |
| Close Punctuation | 50909 | 2.4% |
| Other Punctuation | 6653 | 0.3% |
| Dash Punctuation | 2056 | 0.1% |
| Decimal Number | 48 | < 0.1% |
| Math Symbol | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 236941 | |
| o | 188038 | |
| a | 183246 | |
| e | 150194 | |
| n | 138867 | |
| u | 85315 | 5.4% |
| i | 84625 | 5.4% |
| d | 76684 | 4.9% |
| r | 76363 | 4.9% |
| l | 60301 | 3.8% |
| Other values (28) | 288119 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 73008 | |
| C | 59889 | |
| N | 54961 | |
| H | 14159 | 4.3% |
| B | 13887 | 4.2% |
| M | 13874 | 4.2% |
| P | 13324 | 4.0% |
| L | 12997 | 3.9% |
| A | 12173 | 3.7% |
| D | 9876 | 3.0% |
| Other values (18) | 51999 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4074 | |
| ' | 1745 | |
| , | 626 | 9.4% |
| ? | 107 | 1.6% |
| / | 96 | 1.4% |
| * | 5 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 24 | |
| 2 | 16 | |
| 0 | 8 | 16.7% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 49687 | |
| ( | 1222 | 2.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 49687 | |
| ) | 1222 | 2.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2046 | |
| – | 10 | 0.5% |
Space Separator
| Value | Count | Frequency (%) |
| 133993 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1898840 | |
| Common | 244571 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 236941 | 12.5% |
| o | 188038 | 9.9% |
| a | 183246 | 9.7% |
| e | 150194 | 7.9% |
| n | 138867 | 7.3% |
| u | 85315 | 4.5% |
| i | 84625 | 4.5% |
| d | 76684 | 4.0% |
| r | 76363 | 4.0% |
| S | 73008 | 3.8% |
| Other values (56) | 605559 |
Common
| Value | Count | Frequency (%) |
| 133993 | ||
| [ | 49687 | 20.3% |
| ] | 49687 | 20.3% |
| . | 4074 | 1.7% |
| - | 2046 | 0.8% |
| ' | 1745 | 0.7% |
| ) | 1222 | 0.5% |
| ( | 1222 | 0.5% |
| , | 626 | 0.3% |
| ? | 107 | < 0.1% |
| Other values (7) | 162 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2142524 | |
| None | 877 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 236941 | 11.1% |
| o | 188038 | 8.8% |
| a | 183246 | 8.6% |
| e | 150194 | 7.0% |
| n | 138867 | 6.5% |
| 133993 | 6.3% | |
| u | 85315 | 4.0% |
| i | 84625 | 3.9% |
| d | 76684 | 3.6% |
| r | 76363 | 3.6% |
| Other values (58) | 788258 |
None
| Value | Count | Frequency (%) |
| í | 360 | |
| ü | 153 | |
| é | 137 | 15.6% |
| ã | 45 | 5.1% |
| á | 41 | 4.7% |
| ó | 38 | 4.3% |
| â | 32 | 3.6% |
| ç | 25 | 2.9% |
| ô | 15 | 1.7% |
| ö | 9 | 1.0% |
| Other values (4) | 22 | 2.5% |
Punctuation
| Value | Count | Frequency (%) |
| – | 10 |
locality
Text
Missing 
| Distinct | 31947 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 34082 |
| Missing (%) | 10.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 312 |
|---|---|
| Median length | 249 |
| Mean length | 40.81947246 |
| Min length | 3 |
Unique
| Unique | 4476 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Carr Canyon, Huachuca Mountains |
|---|---|
| 2nd row | Society Islands, Moorea, In front of Hilton |
| 3rd row | Ashdown |
| 4th row | Nielamu Zhen. Route 318 between Zhangmu and Nielamu (Nyalam) ca. 8 km from Zhangmu. |
| 5th row | Mpala Research Centre |
| Value | Count | Frequency (%) |
| of | 95516 | 4.7% |
| km | 27914 | 1.4% |
| road | 26009 | 1.3% |
| on | 20789 | 1.0% |
| island | 19636 | 1.0% |
| and | 19477 | 1.0% |
| national | 18168 | 0.9% |
| river | 17531 | 0.9% |
| creek | 15261 | 0.8% |
| at | 14878 | 0.7% |
| Other values (27243) | 1757365 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1728186 | 13.9% | |
| a | 1103311 | 8.9% |
| e | 889287 | 7.2% |
| o | 819763 | 6.6% |
| n | 662428 | 5.3% |
| i | 647781 | 5.2% |
| r | 607866 | 4.9% |
| t | 592552 | 4.8% |
| l | 449386 | 3.6% |
| s | 433787 | 3.5% |
| Other values (123) | 4489386 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8496745 | |
| Space Separator | 1728186 | 13.9% |
| Uppercase Letter | 1412552 | 11.4% |
| Other Punctuation | 436235 | 3.5% |
| Decimal Number | 259007 | 2.1% |
| Close Punctuation | 32228 | 0.3% |
| Open Punctuation | 32214 | 0.3% |
| Dash Punctuation | 20771 | 0.2% |
| Other Symbol | 2900 | < 0.1% |
| Math Symbol | 1877 | < 0.1% |
| Other values (7) | 1018 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1103311 | |
| e | 889287 | |
| o | 819763 | |
| n | 662428 | 7.8% |
| i | 647781 | 7.6% |
| r | 607866 | 7.2% |
| t | 592552 | 7.0% |
| l | 449386 | 5.3% |
| s | 433787 | 5.1% |
| u | 304184 | 3.6% |
| Other values (44) | 1986400 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 160294 | 11.3% |
| C | 145463 | 10.3% |
| M | 103816 | 7.3% |
| B | 101782 | 7.2% |
| R | 99908 | 7.1% |
| P | 98660 | 7.0% |
| N | 89546 | 6.3% |
| I | 60545 | 4.3% |
| A | 59405 | 4.2% |
| L | 54638 | 3.9% |
| Other values (24) | 438495 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 297955 | |
| . | 103321 | 23.7% |
| ' | 10613 | 2.4% |
| ; | 7920 | 1.8% |
| " | 4321 | 1.0% |
| : | 4150 | 1.0% |
| / | 3514 | 0.8% |
| # | 2998 | 0.7% |
| & | 670 | 0.2% |
| @ | 609 | 0.1% |
| Other values (2) | 164 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 50613 | |
| 2 | 34542 | |
| 0 | 34120 | |
| 5 | 31147 | |
| 3 | 25472 | |
| 4 | 21094 | |
| 6 | 17842 | 6.9% |
| 7 | 16325 | 6.3% |
| 9 | 14534 | 5.6% |
| 8 | 13318 | 5.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1277 | |
| ~ | 431 | 23.0% |
| + | 123 | 6.6% |
| > | 35 | 1.9% |
| < | 8 | 0.4% |
| | | 3 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 26329 | |
| [ | 5884 | 18.3% |
| ‚ | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 26344 | |
| ] | 5884 | 18.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20763 | |
| – | 8 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 2897 | |
| ™ | 3 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1728186 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 813 |
Other Letter
| Value | Count | Frequency (%) |
| º | 158 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 23 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 10 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 6 |
Other Number
| Value | Count | Frequency (%) |
| ¼ | 5 |
Currency Symbol
| Value | Count | Frequency (%) |
| € | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9909455 | |
| Common | 2514278 | 20.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1103311 | 11.1% |
| e | 889287 | 9.0% |
| o | 819763 | 8.3% |
| n | 662428 | 6.7% |
| i | 647781 | 6.5% |
| r | 607866 | 6.1% |
| t | 592552 | 6.0% |
| l | 449386 | 4.5% |
| s | 433787 | 4.4% |
| u | 304184 | 3.1% |
| Other values (79) | 3399110 |
Common
| Value | Count | Frequency (%) |
| 1728186 | ||
| , | 297955 | 11.9% |
| . | 103321 | 4.1% |
| 1 | 50613 | 2.0% |
| 2 | 34542 | 1.4% |
| 0 | 34120 | 1.4% |
| 5 | 31147 | 1.2% |
| ) | 26344 | 1.0% |
| ( | 26329 | 1.0% |
| 3 | 25472 | 1.0% |
| Other values (34) | 156249 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12412648 | |
| None | 10099 | 0.1% |
| Modifier Letters | 813 | < 0.1% |
| Latin Ext Additional | 142 | < 0.1% |
| Punctuation | 25 | < 0.1% |
| Currency Symbols | 3 | < 0.1% |
| Letterlike Symbols | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1728186 | 13.9% | |
| a | 1103311 | 8.9% |
| e | 889287 | 7.2% |
| o | 819763 | 6.6% |
| n | 662428 | 5.3% |
| i | 647781 | 5.2% |
| r | 607866 | 4.9% |
| t | 592552 | 4.8% |
| l | 449386 | 3.6% |
| s | 433787 | 3.5% |
| Other values (77) | 4478301 |
None
| Value | Count | Frequency (%) |
| ° | 2897 | |
| è | 1904 | |
| é | 1027 | 10.2% |
| í | 1025 | 10.1% |
| ā | 813 | 8.1% |
| á | 677 | 6.7% |
| ó | 376 | 3.7% |
| ô | 224 | 2.2% |
| ã | 207 | 2.0% |
| ñ | 168 | 1.7% |
| Other values (24) | 781 | 7.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 813 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ể | 56 | |
| ạ | 56 | |
| ỏ | 10 | 7.0% |
| ả | 10 | 7.0% |
| ố | 10 | 7.0% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 10 | |
| – | 8 | |
| “ | 6 | |
| ‚ | 1 | 4.0% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 3 |
Letterlike Symbols
| Value | Count | Frequency (%) |
| ™ | 3 |
Missing 
| Distinct | 2610 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 249251 |
| Missing (%) | 73.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.173474307 |
| Min length | 3 |
Unique
| Unique | 256 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 1524.0 |
|---|---|
| 2nd row | 2700.0 |
| 3rd row | 1800.0 |
| 4th row | 1000.0 |
| 5th row | 760.0 |
| Value | Count | Frequency (%) |
| 5.0 | 1674 | 1.9% |
| 1100.0 | 1169 | 1.3% |
| 150.0 | 1080 | 1.2% |
| 200.0 | 1047 | 1.2% |
| 1200.0 | 1002 | 1.1% |
| 50.0 | 851 | 1.0% |
| 10.0 | 831 | 0.9% |
| 1829.0 | 752 | 0.8% |
| 100.0 | 735 | 0.8% |
| 1487.0 | 633 | 0.7% |
| Other values (2597) | 79415 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 142356 | |
| . | 89189 | |
| 1 | 52311 | 11.3% |
| 2 | 35256 | 7.6% |
| 5 | 30778 | 6.7% |
| 3 | 22639 | 4.9% |
| 4 | 21024 | 4.6% |
| 7 | 18460 | 4.0% |
| 6 | 17046 | 3.7% |
| 8 | 16799 | 3.6% |
| Other values (2) | 15559 | 3.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 372216 | |
| Other Punctuation | 89189 | 19.3% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 142356 | |
| 1 | 52311 | 14.1% |
| 2 | 35256 | 9.5% |
| 5 | 30778 | 8.3% |
| 3 | 22639 | 6.1% |
| 4 | 21024 | 5.6% |
| 7 | 18460 | 5.0% |
| 6 | 17046 | 4.6% |
| 8 | 16799 | 4.5% |
| 9 | 15547 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 89189 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 461417 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 142356 | |
| . | 89189 | |
| 1 | 52311 | 11.3% |
| 2 | 35256 | 7.6% |
| 5 | 30778 | 6.7% |
| 3 | 22639 | 4.9% |
| 4 | 21024 | 4.6% |
| 7 | 18460 | 4.0% |
| 6 | 17046 | 3.7% |
| 8 | 16799 | 3.6% |
| Other values (2) | 15559 | 3.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 461417 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 142356 | |
| . | 89189 | |
| 1 | 52311 | 11.3% |
| 2 | 35256 | 7.6% |
| 5 | 30778 | 6.7% |
| 3 | 22639 | 4.9% |
| 4 | 21024 | 4.6% |
| 7 | 18460 | 4.0% |
| 6 | 17046 | 3.7% |
| 8 | 16799 | 3.6% |
| Other values (2) | 15559 | 3.4% |
Missing 
| Distinct | 1577 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 284628 |
| Missing (%) | 84.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.241916301 |
| Min length | 3 |
Unique
| Unique | 152 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 1524.0 |
|---|---|
| 2nd row | 1800.0 |
| 3rd row | 1000.0 |
| 4th row | 760.0 |
| 5th row | 650.0 |
| Value | Count | Frequency (%) |
| 1200.0 | 1121 | 2.1% |
| 1100.0 | 864 | 1.6% |
| 15.0 | 845 | 1.6% |
| 200.0 | 770 | 1.4% |
| 1829.0 | 742 | 1.4% |
| 50.0 | 644 | 1.2% |
| 1487.0 | 633 | 1.2% |
| 800.0 | 616 | 1.1% |
| 1707.0 | 575 | 1.1% |
| 1800.0 | 550 | 1.0% |
| Other values (1564) | 46452 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 91330 | |
| . | 53812 | |
| 1 | 31719 | 11.2% |
| 2 | 20464 | 7.3% |
| 5 | 18027 | 6.4% |
| 4 | 13458 | 4.8% |
| 3 | 12000 | 4.3% |
| 7 | 10831 | 3.8% |
| 8 | 10244 | 3.6% |
| 6 | 10141 | 3.6% |
| Other values (2) | 10052 | 3.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 228254 | |
| Other Punctuation | 53812 | 19.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 91330 | |
| 1 | 31719 | 13.9% |
| 2 | 20464 | 9.0% |
| 5 | 18027 | 7.9% |
| 4 | 13458 | 5.9% |
| 3 | 12000 | 5.3% |
| 7 | 10831 | 4.7% |
| 8 | 10244 | 4.5% |
| 6 | 10141 | 4.4% |
| 9 | 10040 | 4.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 53812 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 282078 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 91330 | |
| . | 53812 | |
| 1 | 31719 | 11.2% |
| 2 | 20464 | 7.3% |
| 5 | 18027 | 6.4% |
| 4 | 13458 | 4.8% |
| 3 | 12000 | 4.3% |
| 7 | 10831 | 3.8% |
| 8 | 10244 | 3.6% |
| 6 | 10141 | 3.6% |
| Other values (2) | 10052 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 282078 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 91330 | |
| . | 53812 | |
| 1 | 31719 | 11.2% |
| 2 | 20464 | 7.3% |
| 5 | 18027 | 6.4% |
| 4 | 13458 | 4.8% |
| 3 | 12000 | 4.3% |
| 7 | 10831 | 3.8% |
| 8 | 10244 | 3.6% |
| 6 | 10141 | 3.6% |
| Other values (2) | 10052 | 3.6% |
Missing 
| Distinct | 913 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 322501 |
| Missing (%) | 95.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 27 |
| Mean length | 6.528075789 |
| Min length | 1 |
Unique
| Unique | 164 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | 760 m |
|---|---|
| 2nd row | 1050 ft |
| 3rd row | 611 m |
| 4th row | 73 m |
| 5th row | 500 ft |
| Value | Count | Frequency (%) |
| m | 8076 | |
| ft | 7364 | |
| ca | 906 | 2.6% |
| 503 | 1.5% | |
| 50 | 384 | 1.1% |
| 3440 | 336 | 1.0% |
| sea | 323 | 0.9% |
| level | 323 | 0.9% |
| 54 | 314 | 0.9% |
| 80 | 302 | 0.9% |
| Other values (758) | 15667 |
Most occurring characters
| Value | Count | Frequency (%) |
| 18559 | ||
| 0 | 15811 | |
| m | 8249 | 7.9% |
| t | 7841 | 7.5% |
| f | 7467 | 7.2% |
| 1 | 5481 | 5.3% |
| 5 | 4891 | 4.7% |
| 4 | 4550 | 4.4% |
| 3 | 4549 | 4.4% |
| 2 | 4235 | 4.1% |
| Other values (37) | 22418 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 50353 | |
| Lowercase Letter | 31713 | |
| Space Separator | 18559 | 17.8% |
| Other Punctuation | 1210 | 1.2% |
| Dash Punctuation | 1026 | 1.0% |
| Uppercase Letter | 596 | 0.6% |
| Math Symbol | 364 | 0.3% |
| Open Punctuation | 115 | 0.1% |
| Close Punctuation | 115 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 8249 | |
| t | 7841 | |
| f | 7467 | |
| a | 1699 | 5.4% |
| e | 1622 | 5.1% |
| c | 1235 | 3.9% |
| l | 723 | 2.3% |
| s | 424 | 1.3% |
| r | 422 | 1.3% |
| v | 408 | 1.3% |
| Other values (12) | 1623 | 5.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 15811 | |
| 1 | 5481 | 10.9% |
| 5 | 4891 | 9.7% |
| 4 | 4550 | 9.0% |
| 3 | 4549 | 9.0% |
| 2 | 4235 | 8.4% |
| 6 | 3505 | 7.0% |
| 8 | 3285 | 6.5% |
| 7 | 2223 | 4.4% |
| 9 | 1823 | 3.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1123 | |
| / | 70 | 5.8% |
| ? | 12 | 1.0% |
| ' | 4 | 0.3% |
| , | 1 | 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 197 | |
| P | 195 | |
| G | 195 | |
| L | 9 | 1.5% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 294 | |
| + | 70 | 19.2% |
Space Separator
| Value | Count | Frequency (%) |
| 18559 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1026 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 115 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 115 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 71742 | |
| Latin | 32309 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 8249 | |
| t | 7841 | |
| f | 7467 | |
| a | 1699 | 5.3% |
| e | 1622 | 5.0% |
| c | 1235 | 3.8% |
| l | 723 | 2.2% |
| s | 424 | 1.3% |
| r | 422 | 1.3% |
| v | 408 | 1.3% |
| Other values (16) | 2219 | 6.9% |
Common
| Value | Count | Frequency (%) |
| 18559 | ||
| 0 | 15811 | |
| 1 | 5481 | 7.6% |
| 5 | 4891 | 6.8% |
| 4 | 4550 | 6.3% |
| 3 | 4549 | 6.3% |
| 2 | 4235 | 5.9% |
| 6 | 3505 | 4.9% |
| 8 | 3285 | 4.6% |
| 7 | 2223 | 3.1% |
| Other values (11) | 4653 | 6.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 104051 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 18559 | ||
| 0 | 15811 | |
| m | 8249 | 7.9% |
| t | 7841 | 7.5% |
| f | 7467 | 7.2% |
| 1 | 5481 | 5.3% |
| 5 | 4891 | 4.7% |
| 4 | 4550 | 4.4% |
| 3 | 4549 | 4.4% |
| 2 | 4235 | 4.1% |
| Other values (37) | 22418 |
Missing 
| Distinct | 2030 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 264207 |
| Missing (%) | 78.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 26 |
|---|---|
| Median length | 25 |
| Mean length | 4.129928738 |
| Min length | 3 |
Unique
| Unique | 626 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 1785.34 |
|---|---|
| 2nd row | 13.0 |
| 3rd row | 0.0 |
| 4th row | 25.0 |
| 5th row | 3456.48 |
| Value | Count | Frequency (%) |
| 0.0 | 13845 | 18.6% |
| 1.0 | 5900 | 7.9% |
| 3.0 | 3920 | 5.3% |
| 0.5 | 2538 | 3.4% |
| 2.0 | 2371 | 3.2% |
| 10.0 | 2026 | 2.7% |
| 15.0 | 1882 | 2.5% |
| 1.5 | 1338 | 1.8% |
| 12.0 | 1313 | 1.8% |
| 5.0 | 1309 | 1.8% |
| Other values (2018) | 37795 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 91771 | |
| . | 74229 | |
| 1 | 33738 | 11.0% |
| 2 | 22613 | 7.4% |
| 5 | 20042 | 6.5% |
| 3 | 16696 | 5.4% |
| 6 | 10912 | 3.6% |
| 7 | 9741 | 3.2% |
| 8 | 9557 | 3.1% |
| 9 | 8666 | 2.8% |
| Other values (26) | 8612 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 232211 | |
| Other Punctuation | 74229 | 24.2% |
| Lowercase Letter | 75 | < 0.1% |
| Dash Punctuation | 54 | < 0.1% |
| Space Separator | 4 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11 | |
| o | 8 | |
| r | 7 | |
| e | 6 | 8.0% |
| l | 6 | 8.0% |
| i | 6 | 8.0% |
| s | 5 | 6.7% |
| h | 4 | 5.3% |
| m | 4 | 5.3% |
| t | 3 | 4.0% |
| Other values (9) | 15 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 91771 | |
| 1 | 33738 | 14.5% |
| 2 | 22613 | 9.7% |
| 5 | 20042 | 8.6% |
| 3 | 16696 | 7.2% |
| 6 | 10912 | 4.7% |
| 7 | 9741 | 4.2% |
| 8 | 9557 | 4.1% |
| 9 | 8666 | 3.7% |
| 4 | 8475 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| C | 1 | |
| O | 1 | |
| M | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 74229 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 54 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 306498 | |
| Latin | 79 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11 | |
| o | 8 | |
| r | 7 | 8.9% |
| e | 6 | 7.6% |
| l | 6 | 7.6% |
| i | 6 | 7.6% |
| s | 5 | 6.3% |
| h | 4 | 5.1% |
| m | 4 | 5.1% |
| t | 3 | 3.8% |
| Other values (13) | 19 |
Common
| Value | Count | Frequency (%) |
| 0 | 91771 | |
| . | 74229 | |
| 1 | 33738 | 11.0% |
| 2 | 22613 | 7.4% |
| 5 | 20042 | 6.5% |
| 3 | 16696 | 5.4% |
| 6 | 10912 | 3.6% |
| 7 | 9741 | 3.2% |
| 8 | 9557 | 3.1% |
| 9 | 8666 | 2.8% |
| Other values (3) | 8533 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 306577 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 91771 | |
| . | 74229 | |
| 1 | 33738 | 11.0% |
| 2 | 22613 | 7.4% |
| 5 | 20042 | 6.5% |
| 3 | 16696 | 5.4% |
| 6 | 10912 | 3.6% |
| 7 | 9741 | 3.2% |
| 8 | 9557 | 3.1% |
| 9 | 8666 | 2.8% |
| Other values (26) | 8612 | 2.8% |
Missing 
| Distinct | 1921 |
|---|---|
| Distinct (%) | 2.9% |
| Missing | 271190 |
| Missing (%) | 80.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 4.119048327 |
| Min length | 3 |
Unique
| Unique | 561 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | 1785.34 |
|---|---|
| 2nd row | 17.0 |
| 3rd row | 98.0 |
| 4th row | 35.0 |
| 5th row | 3456.48 |
| Value | Count | Frequency (%) |
| 1.0 | 6735 | 10.0% |
| 3.0 | 5343 | 7.9% |
| 2.0 | 3528 | 5.2% |
| 5.0 | 2501 | 3.7% |
| 0.5 | 1769 | 2.6% |
| 10.0 | 1694 | 2.5% |
| 1.5 | 1506 | 2.2% |
| 20.0 | 1483 | 2.2% |
| 18.0 | 1428 | 2.1% |
| 12.0 | 1153 | 1.7% |
| Other values (1905) | 40110 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 71150 | |
| . | 67250 | |
| 1 | 34373 | |
| 2 | 21363 | 7.7% |
| 5 | 18352 | 6.6% |
| 3 | 17780 | 6.4% |
| 8 | 10244 | 3.7% |
| 6 | 9269 | 3.3% |
| 9 | 9185 | 3.3% |
| 7 | 9177 | 3.3% |
| Other values (2) | 8863 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 209702 | |
| Other Punctuation | 67250 | 24.3% |
| Dash Punctuation | 54 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 71150 | |
| 1 | 34373 | |
| 2 | 21363 | 10.2% |
| 5 | 18352 | 8.8% |
| 3 | 17780 | 8.5% |
| 8 | 10244 | 4.9% |
| 6 | 9269 | 4.4% |
| 9 | 9185 | 4.4% |
| 7 | 9177 | 4.4% |
| 4 | 8809 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 67250 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 54 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 277006 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 71150 | |
| . | 67250 | |
| 1 | 34373 | |
| 2 | 21363 | 7.7% |
| 5 | 18352 | 6.6% |
| 3 | 17780 | 6.4% |
| 8 | 10244 | 3.7% |
| 6 | 9269 | 3.3% |
| 9 | 9185 | 3.3% |
| 7 | 9177 | 3.3% |
| Other values (2) | 8863 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 277006 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 71150 | |
| . | 67250 | |
| 1 | 34373 | |
| 2 | 21363 | 7.7% |
| 5 | 18352 | 6.6% |
| 3 | 17780 | 6.4% |
| 8 | 10244 | 3.7% |
| 6 | 9269 | 3.3% |
| 9 | 9185 | 3.3% |
| 7 | 9177 | 3.3% |
| Other values (2) | 8863 | 3.2% |
verbatimDepth
Text
Missing 
| Distinct | 59 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 336961 |
| Missing (%) | 99.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 91 |
|---|---|
| Median length | 10 |
| Mean length | 8.625422583 |
| Min length | 2 |
Unique
| Unique | 29 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | to 1 m |
|---|---|
| 2nd row | intertidal |
| 3rd row | <0.5 m |
| 4th row | intertidal |
| 5th row | intertidal |
| Value | Count | Frequency (%) |
| intertidal | 778 | |
| m | 259 | 13.5% |
| surface | 253 | 13.2% |
| to | 103 | 5.4% |
| 1 | 95 | 4.9% |
| 0-1 | 84 | 4.4% |
| intertida | 84 | 4.4% |
| 0.5 | 68 | 3.5% |
| 1m | 47 | 2.4% |
| cm | 13 | 0.7% |
| Other values (55) | 138 | 7.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | 7.0% |
| d | 877 | 6.9% |
| l | 806 | 6.3% |
| 443 | 3.5% | |
| I | 353 | 2.8% |
| Other values (41) | 2672 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10679 | |
| Uppercase Letter | 644 | 5.0% |
| Decimal Number | 596 | 4.7% |
| Space Separator | 443 | 3.5% |
| Math Symbol | 161 | 1.3% |
| Other Punctuation | 108 | 0.8% |
| Dash Punctuation | 102 | 0.8% |
| Open Punctuation | 12 | 0.1% |
| Close Punctuation | 12 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | |
| d | 877 | |
| l | 806 | |
| m | 347 | 3.2% |
| c | 260 | 2.4% |
| Other values (12) | 783 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 252 | |
| 0 | 198 | |
| 5 | 82 | 13.8% |
| 2 | 29 | 4.9% |
| 3 | 12 | 2.0% |
| 4 | 5 | 0.8% |
| 6 | 5 | 0.8% |
| 8 | 5 | 0.8% |
| 9 | 4 | 0.7% |
| 7 | 4 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 353 | |
| S | 258 | |
| M | 12 | 1.9% |
| C | 10 | 1.6% |
| A | 5 | 0.8% |
| U | 4 | 0.6% |
| V | 2 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 81 | |
| : | 14 | 13.0% |
| " | 6 | 5.6% |
| , | 4 | 3.7% |
| ; | 3 | 2.8% |
Math Symbol
| Value | Count | Frequency (%) |
| < | 106 | |
| + | 36 | 22.4% |
| ~ | 19 | 11.8% |
Space Separator
| Value | Count | Frequency (%) |
| 443 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 102 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 12 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11323 | |
| Common | 1434 | 11.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | |
| d | 877 | |
| l | 806 | |
| I | 353 | 3.1% |
| m | 347 | 3.1% |
| Other values (19) | 1334 |
Common
| Value | Count | Frequency (%) |
| 443 | ||
| 1 | 252 | |
| 0 | 198 | |
| < | 106 | 7.4% |
| - | 102 | 7.1% |
| 5 | 82 | 5.7% |
| . | 81 | 5.6% |
| + | 36 | 2.5% |
| 2 | 29 | 2.0% |
| ~ | 19 | 1.3% |
| Other values (12) | 86 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12757 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1871 | |
| i | 1380 | |
| e | 1167 | |
| a | 1150 | |
| r | 1147 | |
| n | 891 | 7.0% |
| d | 877 | 6.9% |
| l | 806 | 6.3% |
| 443 | 3.5% | |
| I | 353 | 2.8% |
| Other values (41) | 2672 |
locationRemarks
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338438 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 30.5 |
| Mean length | 30.5 |
| Min length | 21 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Carpenter, Kent E.; Williams, Jeffrey T. |
|---|---|
| 2nd row | Kirkbride, J. H., Jr. |
| Value | Count | Frequency (%) |
| carpenter | 1 | |
| kent | 1 | |
| e | 1 | |
| williams | 1 | |
| jeffrey | 1 | |
| t | 1 | |
| kirkbride | 1 | |
| j | 1 | |
| h | 1 | |
| jr | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | ||
| r | 6 | 9.8% |
| e | 6 | 9.8% |
| . | 5 | 8.2% |
| i | 4 | 6.6% |
| , | 4 | 6.6% |
| J | 3 | 4.9% |
| l | 2 | 3.3% |
| a | 2 | 3.3% |
| K | 2 | 3.3% |
| Other values (16) | 19 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33 | |
| Other Punctuation | 10 | 16.4% |
| Uppercase Letter | 10 | 16.4% |
| Space Separator | 8 | 13.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 6 | |
| e | 6 | |
| i | 4 | |
| l | 2 | 6.1% |
| a | 2 | 6.1% |
| f | 2 | 6.1% |
| t | 2 | 6.1% |
| n | 2 | 6.1% |
| b | 1 | 3.0% |
| k | 1 | 3.0% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 3 | |
| K | 2 | |
| T | 1 | 10.0% |
| C | 1 | 10.0% |
| W | 1 | 10.0% |
| E | 1 | 10.0% |
| H | 1 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 | |
| , | 4 | |
| ; | 1 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43 | |
| Common | 18 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 6 | |
| e | 6 | |
| i | 4 | 9.3% |
| J | 3 | 7.0% |
| l | 2 | 4.7% |
| a | 2 | 4.7% |
| K | 2 | 4.7% |
| f | 2 | 4.7% |
| t | 2 | 4.7% |
| n | 2 | 4.7% |
| Other values (12) | 12 |
Common
| Value | Count | Frequency (%) |
| 8 | ||
| . | 5 | |
| , | 4 | |
| ; | 1 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 61 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | ||
| r | 6 | 9.8% |
| e | 6 | 9.8% |
| . | 5 | 8.2% |
| i | 4 | 6.6% |
| , | 4 | 6.6% |
| J | 3 | 4.9% |
| l | 2 | 3.3% |
| a | 2 | 3.3% |
| K | 2 | 3.3% |
| Other values (16) | 19 |
decimalLatitude
Text
Missing 
| Distinct | 22664 |
|---|---|
| Distinct (%) | 8.6% |
| Missing | 73885 |
| Missing (%) | 21.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 132 |
|---|---|
| Median length | 7 |
| Mean length | 6.755619814 |
| Min length | 3 |
Unique
| Unique | 3759 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | 31.434 |
|---|---|
| 2nd row | 27.5772 |
| 3rd row | -17.4756 |
| 4th row | 28.0392 |
| 5th row | 0.293 |
| Value | Count | Frequency (%) |
| 12.0832 | 1368 | 0.5% |
| 16.802 | 1085 | 0.4% |
| 22.0 | 898 | 0.3% |
| 31.7306 | 895 | 0.3% |
| 5.0 | 792 | 0.3% |
| 17.4726 | 765 | 0.3% |
| 38.6141 | 727 | 0.3% |
| 34.9606 | 682 | 0.3% |
| 17.4825 | 682 | 0.3% |
| 9.82436 | 665 | 0.3% |
| Other values (22436) | 256022 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 264551 | |
| 3 | 224973 | |
| 1 | 184720 | |
| 2 | 162894 | |
| 7 | 157735 | |
| 4 | 151760 | |
| 8 | 134446 | |
| 5 | 129696 | |
| 6 | 125673 | |
| 9 | 107992 | |
| Other values (34) | 142793 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1469719 | |
| Other Punctuation | 264577 | 14.8% |
| Dash Punctuation | 52585 | 2.9% |
| Lowercase Letter | 296 | < 0.1% |
| Uppercase Letter | 30 | < 0.1% |
| Space Separator | 26 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 37 | |
| a | 37 | |
| e | 33 | |
| t | 28 | |
| r | 26 | |
| o | 23 | 7.8% |
| h | 14 | 4.7% |
| n | 12 | 4.1% |
| p | 12 | 4.1% |
| s | 12 | 4.1% |
| Other values (9) | 62 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 7 | |
| C | 6 | |
| O | 5 | |
| N | 3 | |
| V | 2 | 6.7% |
| L | 2 | 6.7% |
| S | 1 | 3.3% |
| I | 1 | 3.3% |
| E | 1 | 3.3% |
| H | 1 | 3.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 224973 | |
| 1 | 184720 | |
| 2 | 162894 | |
| 7 | 157735 | |
| 4 | 151760 | |
| 8 | 134446 | |
| 5 | 129696 | |
| 6 | 125673 | |
| 9 | 107992 | |
| 0 | 89830 | 6.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 264551 | |
| , | 26 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52585 |
Space Separator
| Value | Count | Frequency (%) |
| 26 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1786907 | |
| Latin | 326 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 37 | |
| a | 37 | |
| e | 33 | 10.1% |
| t | 28 | 8.6% |
| r | 26 | 8.0% |
| o | 23 | 7.1% |
| h | 14 | 4.3% |
| n | 12 | 3.7% |
| p | 12 | 3.7% |
| s | 12 | 3.7% |
| Other values (20) | 92 |
Common
| Value | Count | Frequency (%) |
| . | 264551 | |
| 3 | 224973 | |
| 1 | 184720 | |
| 2 | 162894 | |
| 7 | 157735 | |
| 4 | 151760 | |
| 8 | 134446 | |
| 5 | 129696 | |
| 6 | 125673 | |
| 9 | 107992 | |
| Other values (4) | 142467 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1787233 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 264551 | |
| 3 | 224973 | |
| 1 | 184720 | |
| 2 | 162894 | |
| 7 | 157735 | |
| 4 | 151760 | |
| 8 | 134446 | |
| 5 | 129696 | |
| 6 | 125673 | |
| 9 | 107992 | |
| Other values (34) | 142793 |
decimalLongitude
Text
Missing 
| Distinct | 21527 |
|---|---|
| Distinct (%) | 8.1% |
| Missing | 73885 |
| Missing (%) | 21.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.488620514 |
| Min length | 3 |
Unique
| Unique | 3516 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | -110.285 |
|---|---|
| 2nd row | -111.45 |
| 3rd row | -149.842 |
| 4th row | 85.9858 |
| 5th row | 36.899 |
| Value | Count | Frequency (%) |
| 68.8991 | 1350 | 0.5% |
| 56.1167 | 1222 | 0.5% |
| 149.826 | 1219 | 0.5% |
| 88.082 | 1101 | 0.4% |
| 149.775 | 1056 | 0.4% |
| 110.881 | 913 | 0.3% |
| 88.0817 | 836 | 0.3% |
| 80.2986 | 744 | 0.3% |
| 90.2589 | 732 | 0.3% |
| 176.0 | 682 | 0.3% |
| Other values (21343) | 254700 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 264551 | |
| 1 | 242225 | |
| - | 216857 | |
| 8 | 175296 | |
| 7 | 174521 | |
| 9 | 158734 | |
| 6 | 132600 | |
| 4 | 129719 | |
| 2 | 127821 | |
| 5 | 123226 | |
| Other values (8) | 235602 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1499712 | |
| Other Punctuation | 264551 | 13.4% |
| Dash Punctuation | 216857 | 10.9% |
| Lowercase Letter | 28 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 242225 | |
| 8 | 175296 | |
| 7 | 174521 | |
| 9 | 158734 | |
| 6 | 132600 | |
| 4 | 129719 | |
| 2 | 127821 | |
| 5 | 123226 | |
| 3 | 121814 | |
| 0 | 113756 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 8 | |
| a | 8 | |
| n | 4 | |
| m | 4 | |
| l | 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 264551 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 216857 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1981120 | |
| Latin | 32 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 264551 | |
| 1 | 242225 | |
| - | 216857 | |
| 8 | 175296 | |
| 7 | 174521 | |
| 9 | 158734 | |
| 6 | 132600 | |
| 4 | 129719 | |
| 2 | 127821 | |
| 5 | 123226 | |
| Other values (2) | 235570 |
Latin
| Value | Count | Frequency (%) |
| i | 8 | |
| a | 8 | |
| A | 4 | |
| n | 4 | |
| m | 4 | |
| l | 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1981152 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 264551 | |
| 1 | 242225 | |
| - | 216857 | |
| 8 | 175296 | |
| 7 | 174521 | |
| 9 | 158734 | |
| 6 | 132600 | |
| 4 | 129719 | |
| 2 | 127821 | |
| 5 | 123226 | |
| Other values (8) | 235602 |
geodeticDatum
Text
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 308301 |
| Missing (%) | 91.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 18 |
| Mean length | 13.85878762 |
| Min length | 5 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | WGS 84 (EPSG:4326) |
|---|---|
| 2nd row | WGS84 |
| 3rd row | WGS 84 (EPSG:4326) |
| 4th row | WGS 84 (EPSG:4326) |
| 5th row | WGS 84 (EPSG:4326) |
| Value | Count | Frequency (%) |
| wgs | 20207 | |
| 84 | 20207 | |
| epsg:4326 | 19946 | |
| wgs84 | 7743 | 10.9% |
| wgs1984 | 1331 | 1.9% |
| not | 320 | 0.5% |
| recorded | 320 | 0.5% |
| nad27 | 242 | 0.3% |
| nad83 | 230 | 0.3% |
| epsg:4269 | 145 | 0.2% |
| Other values (9) | 176 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 49396 | |
| S | 49372 | |
| 4 | 49372 | |
| 40728 | ||
| 8 | 29587 | 7.1% |
| W | 29281 | 7.0% |
| 2 | 20357 | 4.9% |
| 3 | 20176 | 4.8% |
| ( | 20091 | 4.8% |
| E | 20091 | 4.8% |
| Other values (31) | 89239 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 170341 | |
| Decimal Number | 142780 | |
| Space Separator | 40728 | 9.8% |
| Open Punctuation | 20091 | 4.8% |
| Other Punctuation | 20091 | 4.8% |
| Close Punctuation | 20091 | 4.8% |
| Lowercase Letter | 3568 | 0.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 777 | |
| o | 668 | |
| d | 667 | |
| r | 381 | |
| t | 372 | |
| c | 344 | |
| a | 116 | 3.3% |
| n | 42 | 1.2% |
| l | 38 | 1.1% |
| k | 38 | 1.1% |
| Other values (6) | 125 | 3.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 49396 | |
| S | 49372 | |
| W | 29281 | |
| E | 20091 | |
| P | 20091 | |
| N | 775 | 0.5% |
| D | 496 | 0.3% |
| A | 473 | 0.3% |
| R | 302 | 0.2% |
| C | 40 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 49372 | |
| 8 | 29587 | |
| 2 | 20357 | |
| 3 | 20176 | |
| 6 | 20091 | |
| 9 | 1476 | 1.0% |
| 1 | 1369 | 1.0% |
| 7 | 242 | 0.2% |
| 0 | 72 | 0.1% |
| 5 | 38 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 40728 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 20091 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 20091 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 20091 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 243781 | |
| Latin | 173909 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 49396 | |
| S | 49372 | |
| W | 29281 | |
| E | 20091 | |
| P | 20091 | |
| e | 777 | 0.4% |
| N | 775 | 0.4% |
| o | 668 | 0.4% |
| d | 667 | 0.4% |
| D | 496 | 0.3% |
| Other values (17) | 2295 | 1.3% |
Common
| Value | Count | Frequency (%) |
| 4 | 49372 | |
| 40728 | ||
| 8 | 29587 | |
| 2 | 20357 | |
| 3 | 20176 | |
| ( | 20091 | |
| : | 20091 | |
| 6 | 20091 | |
| ) | 20091 | |
| 9 | 1476 | 0.6% |
| Other values (4) | 1721 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 417690 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 49396 | |
| S | 49372 | |
| 4 | 49372 | |
| 40728 | ||
| 8 | 29587 | 7.1% |
| W | 29281 | 7.0% |
| 2 | 20357 | 4.9% |
| 3 | 20176 | 4.8% |
| ( | 20091 | 4.8% |
| E | 20091 | 4.8% |
| Other values (31) | 89239 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 456 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 327413 |
| Missing (%) | 96.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 3.433481455 |
| Min length | 1 |
Unique
| Unique | 42 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | 500 |
|---|---|
| 2nd row | 500 |
| 3rd row | 140000 |
| 4th row | 100 |
| 5th row | 100 |
| Value | Count | Frequency (%) |
| 100 | 1572 | 14.3% |
| 5 | 436 | 4.0% |
| 14 | 402 | 3.6% |
| 12 | 386 | 3.5% |
| 500 | 366 | 3.3% |
| 10 | 311 | 2.8% |
| 32 | 277 | 2.5% |
| 200 | 273 | 2.5% |
| 15 | 256 | 2.3% |
| 23 | 231 | 2.1% |
| Other values (446) | 6517 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 8138 | |
| 1 | 6725 | |
| 2 | 4908 | |
| 5 | 3562 | |
| 4 | 3253 | 8.6% |
| 3 | 3020 | 8.0% |
| 6 | 2085 | 5.5% |
| 8 | 1726 | 4.6% |
| 9 | 1655 | 4.4% |
| 7 | 1538 | 4.1% |
| Other values (17) | 1251 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 36610 | |
| Other Punctuation | 1210 | 3.2% |
| Lowercase Letter | 37 | 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 6 | |
| t | 5 | |
| n | 4 | |
| c | 3 | |
| o | 3 | |
| p | 3 | |
| e | 3 | |
| r | 2 | 5.4% |
| y | 2 | 5.4% |
| g | 2 | 5.4% |
| Other values (3) | 4 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8138 | |
| 1 | 6725 | |
| 2 | 4908 | |
| 5 | 3562 | |
| 4 | 3253 | 8.9% |
| 3 | 3020 | 8.2% |
| 6 | 2085 | 5.7% |
| 8 | 1726 | 4.7% |
| 9 | 1655 | 4.5% |
| 7 | 1538 | 4.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| I | 1 | |
| E | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1210 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 37820 | |
| Latin | 41 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 6 | |
| t | 5 | |
| n | 4 | |
| c | 3 | 7.3% |
| o | 3 | 7.3% |
| p | 3 | 7.3% |
| e | 3 | 7.3% |
| A | 2 | 4.9% |
| r | 2 | 4.9% |
| y | 2 | 4.9% |
| Other values (6) | 8 |
Common
| Value | Count | Frequency (%) |
| 0 | 8138 | |
| 1 | 6725 | |
| 2 | 4908 | |
| 5 | 3562 | |
| 4 | 3253 | 8.6% |
| 3 | 3020 | 8.0% |
| 6 | 2085 | 5.5% |
| 8 | 1726 | 4.6% |
| 9 | 1655 | 4.4% |
| 7 | 1538 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37861 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 8138 | |
| 1 | 6725 | |
| 2 | 4908 | |
| 5 | 3562 | |
| 4 | 3253 | 8.6% |
| 3 | 3020 | 8.0% |
| 6 | 2085 | 5.5% |
| 8 | 1726 | 4.6% |
| 9 | 1655 | 4.4% |
| 7 | 1538 | 4.1% |
| Other values (17) | 1251 | 3.3% |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 12.5 |
| Mean length | 12.25 |
| Min length | 11 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Characiformes |
|---|---|
| 2nd row | Hoplonemertea |
| 3rd row | Siluriformes |
| 4th row | Lepidoptera |
| Value | Count | Frequency (%) |
| characiformes | 1 | |
| hoplonemertea | 1 | |
| siluriformes | 1 | |
| lepidoptera | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 7 | |
| r | 6 | |
| o | 5 | |
| a | 4 | 8.2% |
| i | 4 | 8.2% |
| m | 3 | 6.1% |
| p | 3 | 6.1% |
| s | 2 | 4.1% |
| t | 2 | 4.1% |
| f | 2 | 4.1% |
| Other values (10) | 11 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45 | |
| Uppercase Letter | 4 | 8.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 7 | |
| r | 6 | |
| o | 5 | |
| a | 4 | |
| i | 4 | |
| m | 3 | |
| p | 3 | |
| s | 2 | 4.4% |
| t | 2 | 4.4% |
| f | 2 | 4.4% |
| Other values (6) | 7 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1 | |
| S | 1 | |
| C | 1 | |
| H | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 7 | |
| r | 6 | |
| o | 5 | |
| a | 4 | 8.2% |
| i | 4 | 8.2% |
| m | 3 | 6.1% |
| p | 3 | 6.1% |
| s | 2 | 4.1% |
| t | 2 | 4.1% |
| f | 2 | 4.1% |
| Other values (10) | 11 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 7 | |
| r | 6 | |
| o | 5 | |
| a | 4 | 8.2% |
| i | 4 | 8.2% |
| m | 3 | 6.1% |
| p | 3 | 6.1% |
| s | 2 | 4.1% |
| t | 2 | 4.1% |
| f | 2 | 4.1% |
| Other values (10) | 11 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 12.5 |
| Mean length | 13.5 |
| Min length | 10 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Characidae |
|---|---|
| 2nd row | Ototyphlonemertidae |
| 3rd row | Callichthyidae |
| 4th row | Limacodidae |
| Value | Count | Frequency (%) |
| characidae | 1 | |
| ototyphlonemertidae | 1 | |
| callichthyidae | 1 | |
| limacodidae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| e | 6 | |
| d | 5 | |
| h | 4 | 7.4% |
| t | 4 | 7.4% |
| c | 3 | 5.6% |
| o | 3 | 5.6% |
| l | 3 | 5.6% |
| C | 2 | 3.7% |
| Other values (7) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 50 | |
| Uppercase Letter | 4 | 7.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| e | 6 | |
| d | 5 | |
| h | 4 | |
| t | 4 | |
| c | 3 | 6.0% |
| o | 3 | 6.0% |
| l | 3 | 6.0% |
| r | 2 | 4.0% |
| Other values (4) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| O | 1 | |
| L | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 54 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| e | 6 | |
| d | 5 | |
| h | 4 | 7.4% |
| t | 4 | 7.4% |
| c | 3 | 5.6% |
| o | 3 | 5.6% |
| l | 3 | 5.6% |
| C | 2 | 3.7% |
| Other values (7) | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| i | 6 | |
| e | 6 | |
| d | 5 | |
| h | 4 | 7.4% |
| t | 4 | 7.4% |
| c | 3 | 5.6% |
| o | 3 | 5.6% |
| l | 3 | 5.6% |
| C | 2 | 3.7% |
| Other values (7) | 10 |
verbatimLatitude
Text
Missing 
| Distinct | 7618 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 230082 |
| Missing (%) | 68.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 29 |
| Mean length | 9.909577512 |
| Min length | 1 |
Unique
| Unique | 1979 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | 27.57721471 |
|---|---|
| 2nd row | -17.47564 |
| 3rd row | 27 25.347 N |
| 4th row | 17 28 57.5 S |
| 5th row | 36.22739648 |
| Value | Count | Frequency (%) |
| n | 36304 | 14.9% |
| s | 13596 | 5.6% |
| 17 | 3921 | 1.6% |
| 12 | 3357 | 1.4% |
| 27 | 3322 | 1.4% |
| 36 | 3186 | 1.3% |
| 3135 | 1.3% | |
| 35 | 3063 | 1.3% |
| 16 | 3017 | 1.2% |
| 38 | 2723 | 1.1% |
| Other values (6455) | 168722 |
Most occurring characters
| Value | Count | Frequency (%) |
| 135988 | ||
| 1 | 101490 | |
| 3 | 91003 | |
| 2 | 86585 | 8.1% |
| . | 84857 | 7.9% |
| 4 | 78858 | 7.3% |
| 7 | 73653 | 6.9% |
| 0 | 73576 | 6.9% |
| 5 | 70329 | 6.5% |
| 8 | 59987 | 5.6% |
| Other values (30) | 217456 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 748860 | |
| Space Separator | 135988 | 12.7% |
| Other Punctuation | 97236 | 9.1% |
| Uppercase Letter | 58993 | 5.5% |
| Dash Punctuation | 29085 | 2.7% |
| Other Symbol | 3208 | 0.3% |
| Lowercase Letter | 387 | < 0.1% |
| Modifier Letter | 22 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
| Other Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 101490 | |
| 3 | 91003 | |
| 2 | 86585 | |
| 4 | 78858 | |
| 7 | 73653 | |
| 0 | 73576 | |
| 5 | 70329 | |
| 8 | 59987 | |
| 9 | 56962 | |
| 6 | 56417 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 122 | |
| d | 116 | |
| g | 116 | |
| a | 13 | 3.4% |
| r | 5 | 1.3% |
| n | 5 | 1.3% |
| t | 5 | 1.3% |
| c | 3 | 0.8% |
| s | 2 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 84857 | |
| ; | 4725 | 4.9% |
| ' | 4356 | 4.5% |
| " | 3260 | 3.4% |
| : | 26 | < 0.1% |
| , | 6 | < 0.1% |
| ? | 4 | < 0.1% |
| / | 2 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 43888 | |
| S | 15037 | 25.5% |
| W | 58 | 0.1% |
| M | 5 | < 0.1% |
| L | 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 29059 | |
| – | 26 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 135988 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3208 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 22 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˚ | 1 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1014401 | |
| Latin | 59381 | 5.5% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 135988 | ||
| 1 | 101490 | |
| 3 | 91003 | |
| 2 | 86585 | |
| . | 84857 | |
| 4 | 78858 | |
| 7 | 73653 | |
| 0 | 73576 | |
| 5 | 70329 | |
| 8 | 59987 | 5.9% |
| Other values (15) | 158075 |
Latin
| Value | Count | Frequency (%) |
| N | 43888 | |
| S | 15037 | 25.3% |
| e | 122 | 0.2% |
| d | 116 | 0.2% |
| g | 116 | 0.2% |
| W | 58 | 0.1% |
| a | 13 | < 0.1% |
| r | 5 | < 0.1% |
| n | 5 | < 0.1% |
| M | 5 | < 0.1% |
| Other values (5) | 16 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1070523 | |
| None | 3209 | 0.3% |
| Punctuation | 27 | < 0.1% |
| Modifier Letters | 23 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 135988 | ||
| 1 | 101490 | |
| 3 | 91003 | |
| 2 | 86585 | |
| . | 84857 | 7.9% |
| 4 | 78858 | 7.4% |
| 7 | 73653 | 6.9% |
| 0 | 73576 | 6.9% |
| 5 | 70329 | 6.6% |
| 8 | 59987 | 5.6% |
| Other values (24) | 214197 |
None
| Value | Count | Frequency (%) |
| ° | 3208 | |
| º | 1 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 26 | |
| ” | 1 | 3.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 22 | |
| ˚ | 1 | 4.3% |
Missing 
| Distinct | 7632 |
|---|---|
| Distinct (%) | 7.0% |
| Missing | 230109 |
| Missing (%) | 68.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 255 |
|---|---|
| Median length | 32 |
| Mean length | 10.73394504 |
| Min length | 2 |
Unique
| Unique | 1960 ? |
|---|---|
| Unique (%) | 1.8% |
Sample
| 1st row | -111.4495292 |
|---|---|
| 2nd row | -149.84247 |
| 3rd row | 79 56.156 W |
| 4th row | 149 53 59.6 W |
| 5th row | -122.879564 |
| Value | Count | Frequency (%) |
| w | 38487 | 15.8% |
| e | 11389 | 4.7% |
| 3089 | 1.3% | |
| 149 | 2722 | 1.1% |
| 53 | 2149 | 0.9% |
| 68 | 1810 | 0.7% |
| 075 | 1719 | 0.7% |
| 79 | 1634 | 0.7% |
| 55 | 1499 | 0.6% |
| 77 | 1463 | 0.6% |
| Other values (6616) | 178159 |
Most occurring characters
| Value | Count | Frequency (%) |
| 135789 | ||
| 1 | 112912 | |
| 0 | 89717 | 7.7% |
| 8 | 85734 | 7.4% |
| . | 84821 | 7.3% |
| 5 | 83305 | 7.2% |
| 7 | 79392 | 6.8% |
| 2 | 77607 | 6.7% |
| 4 | 76922 | 6.6% |
| 9 | 73881 | 6.4% |
| Other values (31) | 262739 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 809633 | |
| Space Separator | 135789 | 11.7% |
| Other Punctuation | 97302 | 8.4% |
| Uppercase Letter | 59020 | 5.1% |
| Dash Punctuation | 57456 | 4.9% |
| Other Symbol | 3208 | 0.3% |
| Lowercase Letter | 385 | < 0.1% |
| Modifier Letter | 22 | < 0.1% |
| Final Punctuation | 2 | < 0.1% |
| Modifier Symbol | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 112912 | |
| 0 | 89717 | |
| 8 | 85734 | |
| 5 | 83305 | |
| 7 | 79392 | |
| 2 | 77607 | |
| 4 | 76922 | |
| 9 | 73881 | |
| 3 | 71552 | |
| 6 | 58611 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 122 | |
| g | 116 | |
| d | 116 | |
| n | 10 | 2.6% |
| a | 7 | 1.8% |
| r | 5 | 1.3% |
| o | 5 | 1.3% |
| c | 2 | 0.5% |
| s | 2 | 0.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 84821 | |
| ; | 4611 | 4.7% |
| ' | 4322 | 4.4% |
| " | 3255 | 3.3% |
| # | 255 | 0.3% |
| : | 26 | < 0.1% |
| , | 8 | < 0.1% |
| ? | 4 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 47002 | |
| E | 11954 | 20.3% |
| S | 49 | 0.1% |
| N | 5 | < 0.1% |
| M | 5 | < 0.1% |
| L | 5 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57430 | |
| – | 26 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 135789 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 3208 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʹ | 22 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ˚ | 1 |
Other Letter
| Value | Count | Frequency (%) |
| º | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1103413 | |
| Latin | 59406 | 5.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 135789 | ||
| 1 | 112912 | |
| 0 | 89717 | |
| 8 | 85734 | |
| . | 84821 | |
| 5 | 83305 | |
| 7 | 79392 | 7.2% |
| 2 | 77607 | 7.0% |
| 4 | 76922 | 7.0% |
| 9 | 73881 | 6.7% |
| Other values (15) | 203333 |
Latin
| Value | Count | Frequency (%) |
| W | 47002 | |
| E | 11954 | 20.1% |
| e | 122 | 0.2% |
| g | 116 | 0.2% |
| d | 116 | 0.2% |
| S | 49 | 0.1% |
| n | 10 | < 0.1% |
| a | 7 | < 0.1% |
| N | 5 | < 0.1% |
| r | 5 | < 0.1% |
| Other values (6) | 20 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1159559 | |
| None | 3209 | 0.3% |
| Punctuation | 28 | < 0.1% |
| Modifier Letters | 23 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 135789 | ||
| 1 | 112912 | |
| 0 | 89717 | 7.7% |
| 8 | 85734 | 7.4% |
| . | 84821 | 7.3% |
| 5 | 83305 | 7.2% |
| 7 | 79392 | 6.8% |
| 2 | 77607 | 6.7% |
| 4 | 76922 | 6.6% |
| 9 | 73881 | 6.4% |
| Other values (25) | 259479 |
None
| Value | Count | Frequency (%) |
| ° | 3208 | |
| º | 1 | < 0.1% |
Punctuation
| Value | Count | Frequency (%) |
| – | 26 | |
| ” | 2 | 7.1% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʹ | 22 | |
| ˚ | 1 | 4.3% |
Missing 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 329369 |
| Missing (%) | 97.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.74655496 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 8924 | |
| minutes | 8849 | |
| seconds | 8849 | |
| township | 107 | 0.4% |
| range | 107 | 0.4% |
| decimal | 75 | 0.3% |
| utm | 24 | 0.1% |
| unknown | 16 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 44652 | |
| s | 26729 | |
| n | 17960 | 8.7% |
| 17880 | 8.7% | |
| g | 9031 | 4.4% |
| i | 9031 | 4.4% |
| D | 8988 | 4.4% |
| o | 8972 | 4.3% |
| c | 8924 | 4.3% |
| r | 8924 | 4.3% |
| Other values (15) | 45243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 161466 | |
| Uppercase Letter | 26988 | 13.1% |
| Space Separator | 17880 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 44652 | |
| s | 26729 | |
| n | 17960 | |
| g | 9031 | 5.6% |
| i | 9031 | 5.6% |
| o | 8972 | 5.6% |
| c | 8924 | 5.5% |
| r | 8924 | 5.5% |
| d | 8860 | 5.5% |
| u | 8849 | 5.5% |
| Other values (8) | 9534 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 8988 | |
| M | 8873 | |
| S | 8849 | |
| T | 131 | 0.5% |
| R | 107 | 0.4% |
| U | 40 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 17880 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 188454 | |
| Common | 17880 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 44652 | |
| s | 26729 | |
| n | 17960 | |
| g | 9031 | 4.8% |
| i | 9031 | 4.8% |
| D | 8988 | 4.8% |
| o | 8972 | 4.8% |
| c | 8924 | 4.7% |
| r | 8924 | 4.7% |
| M | 8873 | 4.7% |
| Other values (14) | 36370 |
Common
| Value | Count | Frequency (%) |
| 17880 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 206334 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 44652 | |
| s | 26729 | |
| n | 17960 | 8.7% |
| 17880 | 8.7% | |
| g | 9031 | 4.4% |
| i | 9031 | 4.4% |
| D | 8988 | 4.4% |
| o | 8972 | 4.3% |
| c | 8924 | 4.3% |
| r | 8924 | 4.3% |
| Other values (15) | 45243 |
verbatimSRS
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 10 |
| Mean length | 10.75 |
| Min length | 6 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Moenkhausia |
|---|---|
| 2nd row | Ototyphlonemertes |
| 3rd row | Corydoras |
| 4th row | Parasa |
| Value | Count | Frequency (%) |
| moenkhausia | 1 | |
| ototyphlonemertes | 1 | |
| corydoras | 1 | |
| parasa | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 5 | |
| e | 4 | 9.3% |
| s | 4 | 9.3% |
| r | 4 | 9.3% |
| t | 3 | 7.0% |
| n | 2 | 4.7% |
| h | 2 | 4.7% |
| y | 2 | 4.7% |
| M | 1 | 2.3% |
| Other values (10) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39 | |
| Uppercase Letter | 4 | 9.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 5 | |
| e | 4 | |
| s | 4 | |
| r | 4 | |
| t | 3 | |
| n | 2 | 5.1% |
| h | 2 | 5.1% |
| y | 2 | 5.1% |
| l | 1 | 2.6% |
| Other values (6) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 | |
| C | 1 | |
| O | 1 | |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 5 | |
| e | 4 | 9.3% |
| s | 4 | 9.3% |
| r | 4 | 9.3% |
| t | 3 | 7.0% |
| n | 2 | 4.7% |
| h | 2 | 4.7% |
| y | 2 | 4.7% |
| M | 1 | 2.3% |
| Other values (10) | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| o | 5 | |
| e | 4 | 9.3% |
| s | 4 | 9.3% |
| r | 4 | 9.3% |
| t | 3 | 7.0% |
| n | 2 | 4.7% |
| h | 2 | 4.7% |
| y | 2 | 4.7% |
| M | 1 | 2.3% |
| Other values (10) | 10 |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 19.75 |
| Min length | 16 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon nudivittis |
|---|---|
| 2nd row | Coccocypselum guianense |
| 3rd row | Emoia caeruleocauda |
| 4th row | Dimorphandra sp. |
| Value | Count | Frequency (%) |
| champsodon | 1 | |
| nudivittis | 1 | |
| coccocypselum | 1 | |
| guianense | 1 | |
| emoia | 1 | |
| caeruleocauda | 1 | |
| dimorphandra | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | 10.1% |
| o | 7 | 8.9% |
| i | 6 | 7.6% |
| c | 5 | 6.3% |
| s | 5 | 6.3% |
| n | 5 | 6.3% |
| u | 5 | 6.3% |
| e | 5 | 6.3% |
| m | 4 | 5.1% |
| p | 4 | 5.1% |
| Other values (13) | 25 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 70 | |
| Space Separator | 4 | 5.1% |
| Uppercase Letter | 4 | 5.1% |
| Other Punctuation | 1 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| o | 7 | |
| i | 6 | 8.6% |
| c | 5 | 7.1% |
| s | 5 | 7.1% |
| n | 5 | 7.1% |
| u | 5 | 7.1% |
| e | 5 | 7.1% |
| m | 4 | 5.7% |
| p | 4 | 5.7% |
| Other values (8) | 16 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 1 | |
| D | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 74 | |
| Common | 5 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | 10.8% |
| o | 7 | 9.5% |
| i | 6 | 8.1% |
| c | 5 | 6.8% |
| s | 5 | 6.8% |
| n | 5 | 6.8% |
| u | 5 | 6.8% |
| e | 5 | 6.8% |
| m | 4 | 5.4% |
| p | 4 | 5.4% |
| Other values (11) | 20 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| . | 1 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 79 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | 10.1% |
| o | 7 | 8.9% |
| i | 6 | 7.6% |
| c | 5 | 6.3% |
| s | 5 | 6.3% |
| n | 5 | 6.3% |
| u | 5 | 6.3% |
| e | 5 | 6.3% |
| m | 4 | 5.1% |
| p | 4 | 5.1% |
| Other values (13) | 25 |
georeferencedBy
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 9 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | hemigrammoides |
|---|---|
| 2nd row | pallida |
| 3rd row | bicolor |
| 4th row | hilarula |
| Value | Count | Frequency (%) |
| hemigrammoides | 1 | |
| pallida | 1 | |
| bicolor | 1 | |
| hilarula | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| l | 5 | |
| m | 3 | |
| r | 3 | |
| o | 3 | |
| h | 2 | 5.6% |
| e | 2 | 5.6% |
| d | 2 | 5.6% |
| g | 1 | 2.8% |
| Other values (5) | 5 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| l | 5 | |
| m | 3 | |
| r | 3 | |
| o | 3 | |
| h | 2 | 5.6% |
| e | 2 | 5.6% |
| d | 2 | 5.6% |
| g | 1 | 2.8% |
| Other values (5) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 36 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| l | 5 | |
| m | 3 | |
| r | 3 | |
| o | 3 | |
| h | 2 | 5.6% |
| e | 2 | 5.6% |
| d | 2 | 5.6% |
| g | 1 | 2.8% |
| Other values (5) | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 5 | |
| a | 5 | |
| l | 5 | |
| m | 3 | |
| r | 3 | |
| o | 3 | |
| h | 2 | 5.6% |
| e | 2 | 5.6% |
| d | 2 | 5.6% |
| g | 1 | 2.8% |
| Other values (5) | 5 |
Missing 
| Distinct | 172 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 255527 |
| Missing (%) | 75.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 228 |
|---|---|
| Median length | 12 |
| Mean length | 16.00867174 |
| Min length | 3 |
Unique
| Unique | 10 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Google Earth |
|---|---|
| 2nd row | Google Earth |
| 3rd row | Google Earth |
| 4th row | GeoLocate |
| 5th row | Google Earth |
| Value | Count | Frequency (%) |
| 50746 | ||
| earth | 44746 | |
| gps | 24192 | 11.6% |
| maps | 6426 | 3.1% |
| georeferencing | 4999 | 2.4% |
| and | 3624 | 1.7% |
| pro | 3253 | 1.6% |
| for | 3180 | 1.5% |
| to | 3180 | 1.5% |
| best | 3179 | 1.5% |
| Other values (336) | 60537 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 140045 | 10.6% |
| 125149 | 9.4% | |
| e | 116930 | 8.8% |
| r | 91937 | 6.9% |
| G | 90373 | 6.8% |
| a | 86896 | 6.5% |
| t | 73045 | 5.5% |
| g | 60345 | 4.5% |
| l | 58359 | 4.4% |
| h | 52505 | 4.0% |
| Other values (59) | 431743 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 893231 | |
| Uppercase Letter | 249335 | 18.8% |
| Space Separator | 125149 | 9.4% |
| Other Punctuation | 25688 | 1.9% |
| Decimal Number | 24656 | 1.9% |
| Close Punctuation | 4365 | 0.3% |
| Open Punctuation | 4365 | 0.3% |
| Dash Punctuation | 538 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 140045 | |
| e | 116930 | |
| r | 91937 | |
| a | 86896 | |
| t | 73045 | |
| g | 60345 | |
| l | 58359 | |
| h | 52505 | 5.9% |
| i | 37111 | 4.2% |
| n | 36414 | 4.1% |
| Other values (15) | 139644 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 90373 | |
| E | 47266 | |
| P | 31313 | 12.6% |
| S | 30537 | 12.2% |
| M | 8773 | 3.5% |
| N | 6265 | 2.5% |
| C | 5723 | 2.3% |
| I | 4595 | 1.8% |
| B | 3747 | 1.5% |
| W | 3527 | 1.4% |
| Other values (13) | 17216 | 6.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9145 | |
| 2 | 5815 | |
| 6 | 3905 | |
| 1 | 2229 | 9.0% |
| 7 | 1433 | 5.8% |
| 9 | 918 | 3.7% |
| 4 | 562 | 2.3% |
| 5 | 509 | 2.1% |
| 3 | 84 | 0.3% |
| 8 | 56 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 11882 | |
| , | 6287 | |
| / | 5944 | |
| : | 1370 | 5.3% |
| & | 153 | 0.6% |
| ! | 40 | 0.2% |
| ; | 12 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 125149 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4365 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4365 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 538 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1142566 | |
| Common | 184761 | 13.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 140045 | |
| e | 116930 | 10.2% |
| r | 91937 | 8.0% |
| G | 90373 | 7.9% |
| a | 86896 | 7.6% |
| t | 73045 | 6.4% |
| g | 60345 | 5.3% |
| l | 58359 | 5.1% |
| h | 52505 | 4.6% |
| E | 47266 | 4.1% |
| Other values (38) | 324865 |
Common
| Value | Count | Frequency (%) |
| 125149 | ||
| . | 11882 | 6.4% |
| 0 | 9145 | 4.9% |
| , | 6287 | 3.4% |
| / | 5944 | 3.2% |
| 2 | 5815 | 3.1% |
| ) | 4365 | 2.4% |
| ( | 4365 | 2.4% |
| 6 | 3905 | 2.1% |
| 1 | 2229 | 1.2% |
| Other values (11) | 5675 | 3.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1327327 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 140045 | 10.6% |
| 125149 | 9.4% | |
| e | 116930 | 8.8% |
| r | 91937 | 6.9% |
| G | 90373 | 6.8% |
| a | 86896 | 6.5% |
| t | 73045 | 5.5% |
| g | 60345 | 4.5% |
| l | 58359 | 4.4% |
| h | 52505 | 4.0% |
| Other values (59) | 431743 |
Missing 
| Distinct | 224 |
|---|---|
| Distinct (%) | 2.4% |
| Missing | 328933 |
| Missing (%) | 97.2% |
| Memory size | 2.6 MiB |
Length
| Max length | 83 |
|---|---|
| Median length | 51 |
| Mean length | 18.5300305 |
| Min length | 2 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Max error (m): 100 |
|---|---|
| 2nd row | Max error (m): 40 |
| 3rd row | Locality extent = 1.6 |
| 4th row | Locality extent = 1 mile |
| 5th row | Max error (m): 200 |
| Value | Count | Frequency (%) |
| m | 5357 | |
| max | 4970 | |
| error | 4970 | |
| 1992 | 5.4% | |
| locality | 1821 | 4.9% |
| extent | 1820 | 4.9% |
| 100 | 1766 | 4.8% |
| 50 | 916 | 2.5% |
| 200 | 740 | 2.0% |
| 4 | 668 | 1.8% |
| Other values (241) | 11826 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27339 | ||
| r | 16685 | 9.5% |
| e | 10655 | 6.0% |
| o | 10235 | 5.8% |
| a | 10121 | 5.7% |
| t | 9615 | 5.5% |
| 0 | 8344 | 4.7% |
| x | 7007 | 4.0% |
| m | 6541 | 3.7% |
| n | 5380 | 3.1% |
| Other values (53) | 64243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 95057 | |
| Space Separator | 27339 | 15.5% |
| Decimal Number | 20326 | 11.5% |
| Uppercase Letter | 12887 | 7.3% |
| Other Punctuation | 8468 | 4.8% |
| Open Punctuation | 4973 | 2.8% |
| Close Punctuation | 4973 | 2.8% |
| Math Symbol | 1820 | 1.0% |
| Dash Punctuation | 322 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 16685 | |
| e | 10655 | |
| o | 10235 | |
| a | 10121 | |
| t | 9615 | |
| x | 7007 | |
| m | 6541 | 6.9% |
| n | 5380 | 5.7% |
| i | 4180 | 4.4% |
| l | 2812 | 3.0% |
| Other values (13) | 11826 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 4970 | |
| L | 1896 | 14.7% |
| S | 1114 | 8.6% |
| E | 1103 | 8.6% |
| W | 999 | 7.8% |
| G | 775 | 6.0% |
| C | 408 | 3.2% |
| H | 372 | 2.9% |
| V | 253 | 2.0% |
| R | 238 | 1.8% |
| Other values (9) | 759 | 5.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 8344 | |
| 1 | 3656 | |
| 5 | 2481 | 12.2% |
| 2 | 1520 | 7.5% |
| 4 | 1399 | 6.9% |
| 8 | 936 | 4.6% |
| 6 | 713 | 3.5% |
| 3 | 496 | 2.4% |
| 7 | 441 | 2.2% |
| 9 | 340 | 1.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 4970 | |
| . | 1939 | 22.9% |
| ; | 1214 | 14.3% |
| , | 311 | 3.7% |
| / | 31 | 0.4% |
| ' | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 27339 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4973 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4973 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1820 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 107944 | |
| Common | 68221 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 16685 | |
| e | 10655 | |
| o | 10235 | |
| a | 10121 | |
| t | 9615 | |
| x | 7007 | 6.5% |
| m | 6541 | 6.1% |
| n | 5380 | 5.0% |
| M | 4970 | 4.6% |
| i | 4180 | 3.9% |
| Other values (32) | 22555 |
Common
| Value | Count | Frequency (%) |
| 27339 | ||
| 0 | 8344 | 12.2% |
| ( | 4973 | 7.3% |
| ) | 4973 | 7.3% |
| : | 4970 | 7.3% |
| 1 | 3656 | 5.4% |
| 5 | 2481 | 3.6% |
| . | 1939 | 2.8% |
| = | 1820 | 2.7% |
| 2 | 1520 | 2.2% |
| Other values (11) | 6206 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 176165 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 27339 | ||
| r | 16685 | 9.5% |
| e | 10655 | 6.0% |
| o | 10235 | 5.8% |
| a | 10121 | 5.7% |
| t | 9615 | 5.5% |
| 0 | 8344 | 4.7% |
| x | 7007 | 4.0% |
| m | 6541 | 3.7% |
| n | 5380 | 3.1% |
| Other values (53) | 64243 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338439 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | (Keferstein) |
|---|
| Value | Count | Frequency (%) |
| keferstein | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3 | |
| ( | 1 | 8.3% |
| K | 1 | 8.3% |
| f | 1 | 8.3% |
| r | 1 | 8.3% |
| s | 1 | 8.3% |
| t | 1 | 8.3% |
| i | 1 | 8.3% |
| n | 1 | 8.3% |
| ) | 1 | 8.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 | |
| Open Punctuation | 1 | 8.3% |
| Uppercase Letter | 1 | 8.3% |
| Close Punctuation | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3 | |
| f | 1 | 11.1% |
| r | 1 | 11.1% |
| s | 1 | 11.1% |
| t | 1 | 11.1% |
| i | 1 | 11.1% |
| n | 1 | 11.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10 | |
| Common | 2 | 16.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3 | |
| K | 1 | 10.0% |
| f | 1 | 10.0% |
| r | 1 | 10.0% |
| s | 1 | 10.0% |
| t | 1 | 10.0% |
| i | 1 | 10.0% |
| n | 1 | 10.0% |
Common
| Value | Count | Frequency (%) |
| ( | 1 | |
| ) | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3 | |
| ( | 1 | 8.3% |
| K | 1 | 8.3% |
| f | 1 | 8.3% |
| r | 1 | 8.3% |
| s | 1 | 8.3% |
| t | 1 | 8.3% |
| i | 1 | 8.3% |
| n | 1 | 8.3% |
| ) | 1 | 8.3% |
earliestEonOrLowestEonothem
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 134 |
|---|---|
| Median length | 71 |
| Mean length | 83.5 |
| Min length | 58 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Osteichthyes, Actinopterygii, Neopterygii, Acanthopterygii, Perciformes, Trachinoidei, Champsodontidae |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Gentianales, Rubiaceae, Rubioideae |
| 3rd row | Animalia, Chordata, Vertebrata, Reptilia, Squamata, Sauria, Scincidae, Eugongylinae |
| 4th row | Plantae, Dicotyledonae, Fabales, Fabaceae, Caesalpinioideae |
| Value | Count | Frequency (%) |
| animalia | 2 | 7.1% |
| plantae | 2 | 7.1% |
| chordata | 2 | 7.1% |
| dicotyledonae | 2 | 7.1% |
| vertebrata | 2 | 7.1% |
| actinopterygii | 1 | 3.6% |
| rubioideae | 1 | 3.6% |
| fabaceae | 1 | 3.6% |
| fabales | 1 | 3.6% |
| eugongylinae | 1 | 3.6% |
| Other values (13) | 13 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | 10.5% |
| i | 32 | 9.6% |
| 24 | 7.2% | |
| , | 24 | 7.2% |
| t | 21 | 6.3% |
| n | 16 | 4.8% |
| o | 16 | 4.8% |
| r | 13 | 3.9% |
| l | 11 | 3.3% |
| Other values (25) | 99 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 258 | |
| Uppercase Letter | 28 | 8.4% |
| Space Separator | 24 | 7.2% |
| Other Punctuation | 24 | 7.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | |
| i | 32 | |
| t | 21 | |
| n | 16 | 6.2% |
| o | 16 | 6.2% |
| r | 13 | 5.0% |
| l | 11 | 4.3% |
| c | 11 | 4.3% |
| d | 10 | 3.9% |
| Other values (10) | 50 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4 | |
| C | 4 | |
| S | 3 | |
| P | 3 | |
| R | 3 | |
| D | 2 | |
| F | 2 | |
| V | 2 | |
| T | 1 | 3.6% |
| G | 1 | 3.6% |
| Other values (3) | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 286 | |
| Common | 48 | 14.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | |
| i | 32 | |
| t | 21 | 7.3% |
| n | 16 | 5.6% |
| o | 16 | 5.6% |
| r | 13 | 4.5% |
| l | 11 | 3.8% |
| c | 11 | 3.8% |
| d | 10 | 3.5% |
| Other values (23) | 78 |
Common
| Value | Count | Frequency (%) |
| 24 | ||
| , | 24 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 334 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 43 | |
| e | 35 | 10.5% |
| i | 32 | 9.6% |
| 24 | 7.2% | |
| , | 24 | 7.2% |
| t | 21 | 6.3% |
| n | 16 | 4.8% |
| o | 16 | 4.8% |
| r | 13 | 3.9% |
| l | 11 | 3.3% |
| Other values (25) | 99 |
latestEonOrHighestEonothem
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7.5 |
| Mean length | 7.5 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Plantae |
| 3rd row | Animalia |
| 4th row | Plantae |
| Value | Count | Frequency (%) |
| animalia | 2 | |
| plantae | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| A | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 2 | 6.7% |
| t | 2 | 6.7% |
| e | 2 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 26 | |
| Uppercase Letter | 4 | 13.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| m | 2 | 7.7% |
| t | 2 | 7.7% |
| e | 2 | 7.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| P | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 30 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| A | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 2 | 6.7% |
| t | 2 | 6.7% |
| e | 2 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| n | 4 | |
| i | 4 | |
| l | 4 | |
| A | 2 | 6.7% |
| m | 2 | 6.7% |
| P | 2 | 6.7% |
| t | 2 | 6.7% |
| e | 2 | 6.7% |
earliestEraOrLowestErathem
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 50.0% |
| Missing | 338438 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Chordata |
| Value | Count | Frequency (%) |
| chordata | 2 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4 | |
| C | 2 | |
| h | 2 | |
| o | 2 | |
| r | 2 | |
| d | 2 | |
| t | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 14 | |
| Uppercase Letter | 2 | 12.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4 | |
| h | 2 | |
| o | 2 | |
| r | 2 | |
| d | 2 | |
| t | 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4 | |
| C | 2 | |
| h | 2 | |
| o | 2 | |
| r | 2 | |
| d | 2 | |
| t | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4 | |
| C | 2 | |
| h | 2 | |
| o | 2 | |
| r | 2 | |
| d | 2 | |
| t | 2 |
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | 75.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 13.5 |
| Mean length | 12 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 50.0% |
Sample
| 1st row | Actinopterygii |
|---|---|
| 2nd row | Dicotyledonae |
| 3rd row | Reptilia |
| 4th row | Dicotyledonae |
| Value | Count | Frequency (%) |
| dicotyledonae | 2 | |
| actinopterygii | 1 | |
| reptilia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 6 | |
| o | 5 | |
| t | 5 | |
| c | 3 | 6.2% |
| y | 3 | 6.2% |
| l | 3 | 6.2% |
| n | 3 | 6.2% |
| a | 3 | 6.2% |
| D | 2 | 4.2% |
| Other values (6) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44 | |
| Uppercase Letter | 4 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 6 | |
| o | 5 | |
| t | 5 | |
| c | 3 | |
| y | 3 | |
| l | 3 | |
| n | 3 | |
| a | 3 | |
| d | 2 | 4.5% |
| Other values (3) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 2 | |
| A | 1 | |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 48 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 6 | |
| o | 5 | |
| t | 5 | |
| c | 3 | 6.2% |
| y | 3 | 6.2% |
| l | 3 | 6.2% |
| n | 3 | 6.2% |
| a | 3 | 6.2% |
| D | 2 | 4.2% |
| Other values (6) | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 7 | |
| e | 6 | |
| o | 5 | |
| t | 5 | |
| c | 3 | 6.2% |
| y | 3 | 6.2% |
| l | 3 | 6.2% |
| n | 3 | 6.2% |
| a | 3 | 6.2% |
| D | 2 | 4.2% |
| Other values (6) | 8 |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9.5 |
| Mean length | 9.25 |
| Min length | 7 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Perciformes |
|---|---|
| 2nd row | Gentianales |
| 3rd row | Squamata |
| 4th row | Fabales |
| Value | Count | Frequency (%) |
| perciformes | 1 | |
| gentianales | 1 | |
| squamata | 1 | |
| fabales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 5 | |
| s | 3 | 8.1% |
| r | 2 | 5.4% |
| i | 2 | 5.4% |
| m | 2 | 5.4% |
| n | 2 | 5.4% |
| t | 2 | 5.4% |
| l | 2 | 5.4% |
| P | 1 | 2.7% |
| Other values (9) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33 | |
| Uppercase Letter | 4 | 10.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 5 | |
| s | 3 | |
| r | 2 | 6.1% |
| i | 2 | 6.1% |
| m | 2 | 6.1% |
| n | 2 | 6.1% |
| t | 2 | 6.1% |
| l | 2 | 6.1% |
| u | 1 | 3.0% |
| Other values (5) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 | |
| S | 1 | |
| F | 1 | |
| G | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 5 | |
| s | 3 | 8.1% |
| r | 2 | 5.4% |
| i | 2 | 5.4% |
| m | 2 | 5.4% |
| n | 2 | 5.4% |
| t | 2 | 5.4% |
| l | 2 | 5.4% |
| P | 1 | 2.7% |
| Other values (9) | 9 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 37 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7 | |
| e | 5 | |
| s | 3 | 8.1% |
| r | 2 | 5.4% |
| i | 2 | 5.4% |
| m | 2 | 5.4% |
| n | 2 | 5.4% |
| t | 2 | 5.4% |
| l | 2 | 5.4% |
| P | 1 | 2.7% |
| Other values (9) | 9 |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 10.25 |
| Min length | 8 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodontidae |
|---|---|
| 2nd row | Rubiaceae |
| 3rd row | Scincidae |
| 4th row | Fabaceae |
| Value | Count | Frequency (%) |
| champsodontidae | 1 | |
| rubiaceae | 1 | |
| scincidae | 1 | |
| fabaceae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 7.3% |
| b | 2 | 4.9% |
| o | 2 | 4.9% |
| n | 2 | 4.9% |
| C | 1 | 2.4% |
| R | 1 | 2.4% |
| Other values (8) | 8 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 37 | |
| Uppercase Letter | 4 | 9.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 8.1% |
| b | 2 | 5.4% |
| o | 2 | 5.4% |
| n | 2 | 5.4% |
| u | 1 | 2.7% |
| t | 1 | 2.7% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| R | 1 | |
| S | 1 | |
| F | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 7.3% |
| b | 2 | 4.9% |
| o | 2 | 4.9% |
| n | 2 | 4.9% |
| C | 1 | 2.4% |
| R | 1 | 2.4% |
| Other values (8) | 8 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 41 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 6 | |
| c | 4 | |
| i | 4 | |
| d | 3 | 7.3% |
| b | 2 | 4.9% |
| o | 2 | 4.9% |
| n | 2 | 4.9% |
| C | 1 | 2.4% |
| R | 1 | 2.4% |
| Other values (8) | 8 |
lowestBiostratigraphicZone
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 10 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Champsodon |
|---|---|
| 2nd row | Coccocypselum |
| 3rd row | Emoia |
| 4th row | Dimorphandra |
| Value | Count | Frequency (%) |
| champsodon | 1 | |
| coccocypselum | 1 | |
| emoia | 1 | |
| dimorphandra | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 36 | |
| Uppercase Letter | 4 | 10.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | |
| m | 4 | |
| c | 3 | |
| p | 3 | |
| n | 2 | 5.6% |
| i | 2 | 5.6% |
| h | 2 | 5.6% |
| d | 2 | 5.6% |
| s | 2 | 5.6% |
| Other values (5) | 6 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2 | |
| E | 1 | |
| D | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 6 | |
| a | 4 | 10.0% |
| m | 4 | 10.0% |
| c | 3 | 7.5% |
| p | 3 | 7.5% |
| n | 2 | 5.0% |
| i | 2 | 5.0% |
| h | 2 | 5.0% |
| C | 2 | 5.0% |
| d | 2 | 5.0% |
| Other values (8) | 10 |
formation
Text
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 338436 |
| Missing (%) | > 99.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 9.5 |
| Mean length | 8.75 |
| Min length | 3 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | nudivittis |
|---|---|
| 2nd row | guianense |
| 3rd row | caeruleocauda |
| 4th row | sp. |
| Value | Count | Frequency (%) |
| nudivittis | 1 | |
| guianense | 1 | |
| caeruleocauda | 1 | |
| sp | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| s | 3 | |
| d | 2 | 5.7% |
| t | 2 | 5.7% |
| c | 2 | 5.7% |
| v | 1 | 2.9% |
| Other values (6) | 6 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 34 | |
| Other Punctuation | 1 | 2.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| s | 3 | |
| d | 2 | 5.9% |
| t | 2 | 5.9% |
| c | 2 | 5.9% |
| v | 1 | 2.9% |
| Other values (5) | 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34 | |
| Common | 1 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| s | 3 | |
| d | 2 | 5.9% |
| t | 2 | 5.9% |
| c | 2 | 5.9% |
| v | 1 | 2.9% |
| Other values (5) | 5 |
Common
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| u | 4 | |
| i | 4 | |
| a | 4 | |
| e | 4 | |
| n | 3 | |
| s | 3 | |
| d | 2 | 5.7% |
| t | 2 | 5.7% |
| c | 2 | 5.7% |
| v | 1 | 2.9% |
| Other values (6) | 6 |
Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 333367 |
| Missing (%) | 98.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 5.2578356 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | aff. |
|---|---|
| 2nd row | cf. |
| 3rd row | aff. |
| 4th row | uncertain |
| 5th row | uncertain |
| Value | Count | Frequency (%) |
| cf | 2742 | |
| uncertain | 1860 | |
| aff | 320 | 6.3% |
| near | 75 | 1.5% |
| complex | 38 | 0.7% |
| sp | 16 | 0.3% |
| group | 12 | 0.2% |
| n | 10 | 0.2% |
| nov | 6 | 0.1% |
| s.l | 5 | 0.1% |
| Other values (5) | 12 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 4635 | |
| n | 3811 | |
| f | 3382 | |
| . | 2735 | |
| a | 2239 | |
| e | 1978 | |
| r | 1947 | |
| t | 1860 | |
| i | 1860 | |
| u | 1846 | 6.9% |
| Other values (18) | 380 | 1.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23860 | |
| Other Punctuation | 2735 | 10.3% |
| Uppercase Letter | 53 | 0.2% |
| Space Separator | 23 | 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4635 | |
| n | 3811 | |
| f | 3382 | |
| a | 2239 | |
| e | 1978 | |
| r | 1947 | |
| t | 1860 | |
| i | 1860 | |
| u | 1846 | 7.7% |
| p | 66 | 0.3% |
| Other values (9) | 236 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 28 | |
| A | 17 | |
| C | 6 | 11.3% |
| K | 1 | 1.9% |
| S | 1 | 1.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2735 |
Space Separator
| Value | Count | Frequency (%) |
| 23 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23913 | |
| Common | 2760 | 10.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 4635 | |
| n | 3811 | |
| f | 3382 | |
| a | 2239 | |
| e | 1978 | |
| r | 1947 | |
| t | 1860 | |
| i | 1860 | |
| u | 1846 | 7.7% |
| p | 66 | 0.3% |
| Other values (14) | 289 | 1.2% |
Common
| Value | Count | Frequency (%) |
| . | 2735 | |
| 23 | 0.8% | |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26673 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 4635 | |
| n | 3811 | |
| f | 3382 | |
| . | 2735 | |
| a | 2239 | |
| e | 1978 | |
| r | 1947 | |
| t | 1860 | |
| i | 1860 | |
| u | 1846 | 6.9% |
| Other values (18) | 380 | 1.4% |
typeStatus
Text
Missing 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 331835 |
| Missing (%) | 98.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 8 |
| Mean length | 8.101438304 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paratype |
|---|---|
| 2nd row | Paratype |
| 3rd row | Paratype |
| 4th row | Paratype |
| 5th row | Paratype |
| Value | Count | Frequency (%) |
| paratype | 5799 | |
| holotype | 332 | 5.0% |
| paralectotype | 125 | 1.9% |
| cotype | 86 | 1.3% |
| syntype | 78 | 1.2% |
| type | 73 | 1.1% |
| of | 34 | 0.5% |
| paratopotype | 33 | 0.5% |
| allotype | 23 | 0.3% |
| ms | 22 | 0.3% |
| Other values (22) | 80 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11984 | |
| e | 6769 | |
| t | 6709 | |
| y | 6671 | |
| p | 6663 | |
| r | 5984 | |
| P | 5961 | |
| o | 1043 | 1.9% |
| l | 522 | 1.0% |
| H | 338 | 0.6% |
| Other values (29) | 866 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46739 | |
| Uppercase Letter | 6685 | 12.5% |
| Space Separator | 80 | 0.1% |
| Other Punctuation | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11984 | |
| e | 6769 | |
| t | 6709 | |
| y | 6671 | |
| p | 6663 | |
| r | 5984 | |
| o | 1043 | 2.2% |
| l | 522 | 1.1% |
| c | 148 | 0.3% |
| n | 93 | 0.2% |
| Other values (10) | 153 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 5961 | |
| H | 338 | 5.1% |
| T | 92 | 1.4% |
| C | 88 | 1.3% |
| S | 78 | 1.2% |
| O | 34 | 0.5% |
| M | 25 | 0.4% |
| A | 24 | 0.4% |
| N | 13 | 0.2% |
| L | 12 | 0.2% |
| Other values (6) | 20 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| ? | 4 | |
| ; | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 80 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 53424 | |
| Common | 86 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11984 | |
| e | 6769 | |
| t | 6709 | |
| y | 6671 | |
| p | 6663 | |
| r | 5984 | |
| P | 5961 | |
| o | 1043 | 2.0% |
| l | 522 | 1.0% |
| H | 338 | 0.6% |
| Other values (26) | 780 | 1.5% |
Common
| Value | Count | Frequency (%) |
| 80 | ||
| ? | 4 | 4.7% |
| ; | 2 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53510 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11984 | |
| e | 6769 | |
| t | 6709 | |
| y | 6671 | |
| p | 6663 | |
| r | 5984 | |
| P | 5961 | |
| o | 1043 | 1.9% |
| l | 522 | 1.0% |
| H | 338 | 0.6% |
| Other values (29) | 866 | 1.6% |
identifiedBy
Text
Missing 
| Distinct | 1866 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 226287 |
| Missing (%) | 66.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 150 |
|---|---|
| Median length | 128 |
| Mean length | 39.12073685 |
| Min length | 2 |
Unique
| Unique | 200 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Anker, Arthur |
|---|---|
| 2nd row | Osborn, Karen J., (IZ), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 3rd row | Baldwin, Carole C. |
| 4th row | Hobbs, Horton H., Jr., Smithsonian Institution, National Museum of Natural History |
| 5th row | Paulay, Gustav, University of Florida (UNITED STATES) |
| Value | Count | Frequency (%) |
| united | 36063 | 5.8% |
| states | 36020 | 5.8% |
| of | 27856 | 4.5% |
| smithsonian | 24359 | 3.9% |
| 22487 | 3.6% | |
| institution | 20525 | 3.3% |
| national | 18667 | 3.0% |
| museum | 17529 | 2.8% |
| natural | 17249 | 2.8% |
| history | 17170 | 2.8% |
| Other values (2280) | 384533 |
Most occurring characters
| Value | Count | Frequency (%) |
| 510305 | 11.6% | |
| i | 263200 | 6.0% |
| a | 260172 | 5.9% |
| t | 236846 | 5.4% |
| n | 236742 | 5.4% |
| o | 217779 | 5.0% |
| e | 199321 | 4.5% |
| , | 179315 | 4.1% |
| r | 173463 | 4.0% |
| s | 169712 | 3.9% |
| Other values (73) | 1940653 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2460341 | |
| Uppercase Letter | 994498 | |
| Space Separator | 510305 | 11.6% |
| Other Punctuation | 274335 | 6.3% |
| Close Punctuation | 61794 | 1.4% |
| Open Punctuation | 61794 | 1.4% |
| Dash Punctuation | 23913 | 0.5% |
| Decimal Number | 518 | < 0.1% |
| Initial Punctuation | 5 | < 0.1% |
| Final Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 263200 | |
| a | 260172 | |
| t | 236846 | |
| n | 236742 | |
| o | 217779 | |
| e | 199321 | |
| r | 173463 | 7.1% |
| s | 169712 | 6.9% |
| l | 137141 | 5.6% |
| u | 122744 | 5.0% |
| Other values (27) | 443221 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 131860 | |
| S | 126119 | |
| E | 89252 | 9.0% |
| N | 81544 | 8.2% |
| I | 69952 | 7.0% |
| A | 68290 | 6.9% |
| D | 60287 | 6.1% |
| U | 50369 | 5.1% |
| M | 44706 | 4.5% |
| B | 34118 | 3.4% |
| Other values (18) | 238001 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 179315 | |
| . | 89290 | |
| ; | 4450 | 1.6% |
| ' | 576 | 0.2% |
| & | 430 | 0.2% |
| / | 274 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 148 | |
| 4 | 74 | |
| 6 | 74 | |
| 0 | 74 | |
| 1 | 74 | |
| 9 | 74 |
Space Separator
| Value | Count | Frequency (%) |
| 510305 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 61794 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 61794 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 23913 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 5 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3454839 | |
| Common | 932669 | 21.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 263200 | 7.6% |
| a | 260172 | 7.5% |
| t | 236846 | 6.9% |
| n | 236742 | 6.9% |
| o | 217779 | 6.3% |
| e | 199321 | 5.8% |
| r | 173463 | 5.0% |
| s | 169712 | 4.9% |
| l | 137141 | 4.0% |
| T | 131860 | 3.8% |
| Other values (55) | 1428603 |
Common
| Value | Count | Frequency (%) |
| 510305 | ||
| , | 179315 | 19.2% |
| . | 89290 | 9.6% |
| ) | 61794 | 6.6% |
| ( | 61794 | 6.6% |
| - | 23913 | 2.6% |
| ; | 4450 | 0.5% |
| ' | 576 | 0.1% |
| & | 430 | < 0.1% |
| / | 274 | < 0.1% |
| Other values (8) | 528 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4386974 | |
| None | 524 | < 0.1% |
| Punctuation | 10 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 510305 | 11.6% | |
| i | 263200 | 6.0% |
| a | 260172 | 5.9% |
| t | 236846 | 5.4% |
| n | 236742 | 5.4% |
| o | 217779 | 5.0% |
| e | 199321 | 4.5% |
| , | 179315 | 4.1% |
| r | 173463 | 4.0% |
| s | 169712 | 3.9% |
| Other values (58) | 1940119 |
None
| Value | Count | Frequency (%) |
| í | 212 | |
| ö | 129 | |
| á | 99 | |
| ø | 29 | 5.5% |
| ú | 26 | 5.0% |
| ó | 12 | 2.3% |
| Ø | 7 | 1.3% |
| ë | 3 | 0.6% |
| è | 3 | 0.6% |
| ñ | 1 | 0.2% |
| Other values (3) | 3 | 0.6% |
Punctuation
| Value | Count | Frequency (%) |
| “ | 5 | |
| ” | 5 |
scientificName
Text
Missing 
| Distinct | 46019 |
|---|---|
| Distinct (%) | 14.6% |
| Missing | 24062 |
| Missing (%) | 7.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 85 |
|---|---|
| Median length | 63 |
| Mean length | 18.57501161 |
| Min length | 3 |
Unique
| Unique | 10046 ? |
|---|---|
| Unique (%) | 3.2% |
Sample
| 1st row | Rectiostoma fernaldella |
|---|---|
| 2nd row | Polystichum sp. |
| 3rd row | Mesontoplatys bolzi |
| 4th row | Bursa granularis |
| 5th row | Amanses scopas |
| Value | Count | Frequency (%) |
| sp | 50672 | 7.9% |
| plethodon | 4677 | 0.7% |
| orconectes | 4553 | 0.7% |
| indet | 4208 | 0.7% |
| procambarus | 3787 | 0.6% |
| unidentified | 3704 | 0.6% |
| bathymodiolus | 2599 | 0.4% |
| cinereus | 2327 | 0.4% |
| riftia | 2008 | 0.3% |
| truncatus | 1928 | 0.3% |
| Other values (42986) | 557516 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 627874 | 10.8% |
| i | 494432 | 8.5% |
| s | 465939 | 8.0% |
| e | 412196 | 7.1% |
| o | 370175 | 6.3% |
| r | 352777 | 6.0% |
| 323601 | 5.5% | |
| l | 300324 | 5.1% |
| n | 295293 | 5.1% |
| t | 291408 | 5.0% |
| Other values (69) | 1905556 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5135444 | |
| Space Separator | 323601 | 5.5% |
| Uppercase Letter | 317060 | 5.4% |
| Other Punctuation | 57396 | 1.0% |
| Open Punctuation | 2361 | < 0.1% |
| Close Punctuation | 2361 | < 0.1% |
| Decimal Number | 881 | < 0.1% |
| Connector Punctuation | 279 | < 0.1% |
| Dash Punctuation | 192 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 627874 | |
| i | 494432 | 9.6% |
| s | 465939 | 9.1% |
| e | 412196 | 8.0% |
| o | 370175 | 7.2% |
| r | 352777 | 6.9% |
| l | 300324 | 5.8% |
| n | 295293 | 5.8% |
| t | 291408 | 5.7% |
| u | 273496 | 5.3% |
| Other values (19) | 1251530 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 48132 | |
| C | 36686 | |
| A | 33497 | |
| S | 23837 | 7.5% |
| M | 18826 | 5.9% |
| E | 17864 | 5.6% |
| L | 16391 | 5.2% |
| H | 16038 | 5.1% |
| T | 14052 | 4.4% |
| D | 13559 | 4.3% |
| Other values (17) | 78178 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 270 | |
| 1 | 264 | |
| 2 | 99 | 11.2% |
| 3 | 66 | 7.5% |
| 6 | 56 | 6.4% |
| 7 | 46 | 5.2% |
| 8 | 26 | 3.0% |
| 4 | 23 | 2.6% |
| 5 | 17 | 1.9% |
| 9 | 14 | 1.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 56743 | |
| " | 306 | 0.5% |
| ' | 252 | 0.4% |
| , | 65 | 0.1% |
| / | 13 | < 0.1% |
| & | 11 | < 0.1% |
| ? | 5 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 323601 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2361 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2361 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 279 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 192 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5452504 | |
| Common | 387071 | 6.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 627874 | |
| i | 494432 | 9.1% |
| s | 465939 | 8.5% |
| e | 412196 | 7.6% |
| o | 370175 | 6.8% |
| r | 352777 | 6.5% |
| l | 300324 | 5.5% |
| n | 295293 | 5.4% |
| t | 291408 | 5.3% |
| u | 273496 | 5.0% |
| Other values (46) | 1568590 |
Common
| Value | Count | Frequency (%) |
| 323601 | ||
| . | 56743 | 14.7% |
| ( | 2361 | 0.6% |
| ) | 2361 | 0.6% |
| " | 306 | 0.1% |
| _ | 279 | 0.1% |
| 0 | 270 | 0.1% |
| 1 | 264 | 0.1% |
| ' | 252 | 0.1% |
| - | 192 | < 0.1% |
| Other values (13) | 442 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5839562 | |
| None | 13 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 627874 | 10.8% |
| i | 494432 | 8.5% |
| s | 465939 | 8.0% |
| e | 412196 | 7.1% |
| o | 370175 | 6.3% |
| r | 352777 | 6.0% |
| 323601 | 5.5% | |
| l | 300324 | 5.1% |
| n | 295293 | 5.1% |
| t | 291408 | 5.0% |
| Other values (65) | 1905543 |
None
| Value | Count | Frequency (%) |
| ë | 9 | |
| ö | 2 | 15.4% |
| Á | 1 | 7.7% |
| é | 1 | 7.7% |
Missing 
| Distinct | 4815 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 5901 |
| Missing (%) | 1.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 162 |
|---|---|
| Median length | 142 |
| Mean length | 76.5582473 |
| Min length | 6 |
Unique
| Unique | 458 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Arthropoda, Insecta, Lepidoptera, Depressariidae, Stenomatinae |
|---|---|
| 2nd row | Animalia, Annelida, Polychaeta, Sedentaria, Canalipalpata, Sabellida, Siboglinidae |
| 3rd row | Animalia, Annelida, Polychaeta, Errantia, Amphinomida, Amphinomidae |
| 4th row | Animalia, Arthropoda, Crustacea, Malacostraca, Eumalacostraca, Eucarida, Decapoda, Pleocyemata, Cambaridae |
| 5th row | Plantae, Pteridophyte, Polypodiales, Dryopteridaceae |
| Value | Count | Frequency (%) |
| animalia | 287708 | 13.0% |
| arthropoda | 145883 | 6.6% |
| insecta | 113237 | 5.1% |
| chordata | 103543 | 4.7% |
| vertebrata | 102503 | 4.6% |
| lepidoptera | 79773 | 3.6% |
| actinopterygii | 40747 | 1.8% |
| osteichthyes | 40745 | 1.8% |
| neopterygii | 40742 | 1.8% |
| plantae | 35547 | 1.6% |
| Other values (5328) | 1221137 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3312348 | |
| i | 2171272 | 8.5% |
| e | 2153801 | 8.5% |
| 1879026 | 7.4% | |
| , | 1876750 | 7.4% |
| t | 1539991 | 6.0% |
| r | 1526780 | 6.0% |
| o | 1482836 | 5.8% |
| n | 1001707 | 3.9% |
| d | 934382 | 3.7% |
| Other values (58) | 7579710 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 19488128 | |
| Uppercase Letter | 2209251 | 8.7% |
| Other Punctuation | 1880599 | 7.4% |
| Space Separator | 1879026 | 7.4% |
| Close Punctuation | 715 | < 0.1% |
| Open Punctuation | 715 | < 0.1% |
| Dash Punctuation | 127 | < 0.1% |
| Decimal Number | 30 | < 0.1% |
| Connector Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3312348 | |
| i | 2171272 | |
| e | 2153801 | |
| t | 1539991 | |
| r | 1526780 | |
| o | 1482836 | |
| n | 1001707 | 5.1% |
| d | 934382 | 4.8% |
| l | 865315 | 4.4% |
| c | 813387 | 4.2% |
| Other values (17) | 3686309 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 616961 | |
| C | 270329 | |
| P | 208454 | 9.4% |
| M | 125019 | 5.7% |
| I | 120936 | 5.5% |
| E | 116858 | 5.3% |
| L | 112485 | 5.1% |
| V | 112145 | 5.1% |
| S | 86438 | 3.9% |
| D | 72328 | 3.3% |
| Other values (16) | 367298 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9 | |
| 0 | 6 | |
| 1 | 6 | |
| 3 | 6 | |
| 9 | 3 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1876750 | |
| . | 3849 | 0.2% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 679 | |
| ] | 36 | 5.0% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 679 | |
| [ | 36 | 5.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| – | 124 | |
| - | 3 | 2.4% |
Space Separator
| Value | Count | Frequency (%) |
| 1879026 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21697379 | |
| Common | 3761224 | 14.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3312348 | |
| i | 2171272 | 10.0% |
| e | 2153801 | 9.9% |
| t | 1539991 | 7.1% |
| r | 1526780 | 7.0% |
| o | 1482836 | 6.8% |
| n | 1001707 | 4.6% |
| d | 934382 | 4.3% |
| l | 865315 | 4.0% |
| c | 813387 | 3.7% |
| Other values (43) | 5895560 |
Common
| Value | Count | Frequency (%) |
| 1879026 | ||
| , | 1876750 | |
| . | 3849 | 0.1% |
| ) | 679 | < 0.1% |
| ( | 679 | < 0.1% |
| – | 124 | < 0.1% |
| [ | 36 | < 0.1% |
| ] | 36 | < 0.1% |
| _ | 12 | < 0.1% |
| 6 | 9 | < 0.1% |
| Other values (5) | 24 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25458461 | |
| Punctuation | 124 | < 0.1% |
| None | 18 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3312348 | |
| i | 2171272 | 8.5% |
| e | 2153801 | 8.5% |
| 1879026 | 7.4% | |
| , | 1876750 | 7.4% |
| t | 1539991 | 6.0% |
| r | 1526780 | 6.0% |
| o | 1482836 | 5.8% |
| n | 1001707 | 3.9% |
| d | 934382 | 3.7% |
| Other values (56) | 7579568 |
Punctuation
| Value | Count | Frequency (%) |
| – | 124 |
None
| Value | Count | Frequency (%) |
| ö | 18 |
kingdom
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10613 |
| Missing (%) | 3.1% |
| Memory size | 2.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 8 |
| Mean length | 7.904840053 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Animalia |
| 4th row | Animalia |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| animalia | 287708 | |
| plantae | 35547 | 10.8% |
| chromista | 2994 | 0.9% |
| eubacteria | 1163 | 0.4% |
| fungi | 322 | 0.1% |
| protista | 42 | < 0.1% |
| metazoa | 24 | < 0.1% |
| eukaryota | 21 | < 0.1% |
| bacteria | 3 | < 0.1% |
| protozoa | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 651971 | |
| i | 579940 | |
| n | 323577 | |
| l | 323255 | |
| m | 290702 | |
| A | 287708 | |
| t | 39839 | 1.5% |
| e | 36737 | 1.4% |
| P | 35592 | 1.4% |
| r | 4226 | 0.2% |
| Other values (15) | 17873 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2263593 | |
| Uppercase Letter | 327827 | 12.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 651971 | |
| i | 579940 | |
| n | 323577 | |
| l | 323255 | |
| m | 290702 | |
| t | 39839 | 1.8% |
| e | 36737 | 1.6% |
| r | 4226 | 0.2% |
| o | 3090 | 0.1% |
| s | 3036 | 0.1% |
| Other values (8) | 7220 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 287708 | |
| P | 35592 | 10.9% |
| C | 2994 | 0.9% |
| E | 1184 | 0.4% |
| F | 322 | 0.1% |
| M | 24 | < 0.1% |
| B | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2591420 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 651971 | |
| i | 579940 | |
| n | 323577 | |
| l | 323255 | |
| m | 290702 | |
| A | 287708 | |
| t | 39839 | 1.5% |
| e | 36737 | 1.4% |
| P | 35592 | 1.4% |
| r | 4226 | 0.2% |
| Other values (15) | 17873 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2591420 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 651971 | |
| i | 579940 | |
| n | 323577 | |
| l | 323255 | |
| m | 290702 | |
| A | 287708 | |
| t | 39839 | 1.5% |
| e | 36737 | 1.4% |
| P | 35592 | 1.4% |
| r | 4226 | 0.2% |
| Other values (15) | 17873 | 0.7% |
phylum
Text
Missing 
| Distinct | 62 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 36740 |
| Missing (%) | 10.9% |
| Memory size | 2.6 MiB |
Length
| Max length | 33 |
|---|---|
| Median length | 25 |
| Mean length | 9.088574743 |
| Min length | 6 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Arthropoda |
|---|---|
| 2nd row | Annelida |
| 3rd row | Annelida |
| 4th row | Arthropoda |
| 5th row | Arthropoda |
| Value | Count | Frequency (%) |
| arthropoda | 145883 | |
| chordata | 103543 | |
| mollusca | 20757 | 6.9% |
| annelida | 11339 | 3.8% |
| cnidaria | 3181 | 1.1% |
| rhodophyta | 2943 | 1.0% |
| miozoa | 2074 | 0.7% |
| echinodermata | 1631 | 0.5% |
| chlorophyta | 1622 | 0.5% |
| porifera | 1250 | 0.4% |
| Other values (60) | 8028 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 438855 | |
| a | 414148 | |
| r | 409436 | |
| d | 269990 | |
| h | 264892 | |
| t | 263154 | |
| A | 157278 | 5.7% |
| p | 152337 | 5.6% |
| C | 109732 | 4.0% |
| l | 56835 | 2.1% |
| Other values (36) | 205366 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2439452 | |
| Uppercase Letter | 301700 | 11.0% |
| Space Separator | 551 | < 0.1% |
| Other Punctuation | 196 | < 0.1% |
| Dash Punctuation | 124 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 438855 | |
| a | 414148 | |
| r | 409436 | |
| d | 269990 | |
| h | 264892 | |
| t | 263154 | |
| p | 152337 | 6.2% |
| l | 56835 | 2.3% |
| n | 31280 | 1.3% |
| i | 27003 | 1.1% |
| Other values (14) | 111522 | 4.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 157278 | |
| C | 109732 | |
| M | 23023 | 7.6% |
| R | 2965 | 1.0% |
| P | 2470 | 0.8% |
| E | 1681 | 0.6% |
| N | 1362 | 0.5% |
| B | 1240 | 0.4% |
| O | 830 | 0.3% |
| S | 250 | 0.1% |
| Other values (9) | 869 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 551 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 196 |
Dash Punctuation
| Value | Count | Frequency (%) |
| – | 124 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2741152 | |
| Common | 871 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 438855 | |
| a | 414148 | |
| r | 409436 | |
| d | 269990 | |
| h | 264892 | |
| t | 263154 | |
| A | 157278 | 5.7% |
| p | 152337 | 5.6% |
| C | 109732 | 4.0% |
| l | 56835 | 2.1% |
| Other values (33) | 204495 |
Common
| Value | Count | Frequency (%) |
| 551 | ||
| . | 196 | 22.5% |
| – | 124 | 14.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2741899 | |
| Punctuation | 124 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 438855 | |
| a | 414148 | |
| r | 409436 | |
| d | 269990 | |
| h | 264892 | |
| t | 263154 | |
| A | 157278 | 5.7% |
| p | 152337 | 5.6% |
| C | 109732 | 4.0% |
| l | 56835 | 2.1% |
| Other values (35) | 205242 |
Punctuation
| Value | Count | Frequency (%) |
| – | 124 |
class
Text
Missing 
| Distinct | 112 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 12521 |
| Missing (%) | 3.7% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 19 |
| Mean length | 9.50620553 |
| Min length | 4 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Insecta |
|---|---|
| 2nd row | Polychaeta |
| 3rd row | Polychaeta |
| 4th row | Malacostraca |
| 5th row | Pteridophyte |
| Value | Count | Frequency (%) |
| insecta | 113237 | |
| actinopterygii | 40747 | 12.5% |
| malacostraca | 27947 | 8.6% |
| mammalia | 24499 | 7.5% |
| amphibia | 18405 | 5.6% |
| dicotyledonae | 15871 | 4.9% |
| monocotyledonae | 10880 | 3.3% |
| polychaeta | 10696 | 3.3% |
| reptilia | 9873 | 3.0% |
| bivalvia | 9780 | 3.0% |
| Other values (102) | 44674 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 446449 | |
| t | 292487 | 9.4% |
| e | 267717 | 8.6% |
| c | 260251 | 8.4% |
| i | 258346 | 8.3% |
| o | 207477 | 6.7% |
| n | 200830 | 6.5% |
| s | 163672 | 5.3% |
| l | 122646 | 4.0% |
| I | 113253 | 3.7% |
| Other values (38) | 765125 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2770286 | |
| Uppercase Letter | 325917 | 10.5% |
| Space Separator | 690 | < 0.1% |
| Close Punctuation | 678 | < 0.1% |
| Open Punctuation | 678 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 446449 | |
| t | 292487 | |
| e | 267717 | |
| c | 260251 | |
| i | 258346 | |
| o | 207477 | |
| n | 200830 | |
| s | 163672 | 5.9% |
| l | 122646 | 4.4% |
| p | 95959 | 3.5% |
| Other values (14) | 454452 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 113253 | |
| A | 73869 | |
| M | 65072 | |
| D | 17381 | 5.3% |
| P | 15685 | 4.8% |
| R | 10130 | 3.1% |
| B | 9824 | 3.0% |
| G | 9575 | 2.9% |
| F | 2547 | 0.8% |
| C | 2402 | 0.7% |
| Other values (10) | 6179 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 690 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 678 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 678 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3096203 | |
| Common | 2050 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 446449 | |
| t | 292487 | 9.4% |
| e | 267717 | 8.6% |
| c | 260251 | 8.4% |
| i | 258346 | 8.3% |
| o | 207477 | 6.7% |
| n | 200830 | 6.5% |
| s | 163672 | 5.3% |
| l | 122646 | 4.0% |
| I | 113253 | 3.7% |
| Other values (34) | 763075 |
Common
| Value | Count | Frequency (%) |
| 690 | ||
| ) | 678 | |
| ( | 678 | |
| . | 4 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3098253 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 446449 | |
| t | 292487 | 9.4% |
| e | 267717 | 8.6% |
| c | 260251 | 8.4% |
| i | 258346 | 8.3% |
| o | 207477 | 6.7% |
| n | 200830 | 6.5% |
| s | 163672 | 5.3% |
| l | 122646 | 4.0% |
| I | 113253 | 3.7% |
| Other values (38) | 765125 |
order
Text
Missing 
| Distinct | 532 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 30431 |
| Missing (%) | 9.0% |
| Memory size | 2.6 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 22 |
| Mean length | 9.884892974 |
| Min length | 5 |
Unique
| Unique | 51 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Lepidoptera |
|---|---|
| 2nd row | Sabellida |
| 3rd row | Amphinomida |
| 4th row | Decapoda |
| 5th row | Polypodiales |
| Value | Count | Frequency (%) |
| lepidoptera | 79773 | |
| perciformes | 26030 | 8.4% |
| decapoda | 23842 | 7.7% |
| coleoptera | 10156 | 3.3% |
| anura | 10022 | 3.3% |
| squamata | 9570 | 3.1% |
| hymenoptera | 8500 | 2.8% |
| rodentia | 8413 | 2.7% |
| caudata | 8212 | 2.7% |
| poales | 7860 | 2.6% |
| Other values (523) | 115676 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 429551 | |
| a | 377525 | |
| o | 262354 | 8.6% |
| r | 255846 | 8.4% |
| p | 249068 | 8.2% |
| i | 218439 | 7.2% |
| t | 178257 | 5.9% |
| d | 158092 | 5.2% |
| s | 110896 | 3.6% |
| l | 90314 | 3.0% |
| Other values (44) | 714294 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2735137 | |
| Uppercase Letter | 307980 | 10.1% |
| Other Punctuation | 1416 | < 0.1% |
| Space Separator | 45 | < 0.1% |
| Open Punctuation | 29 | < 0.1% |
| Close Punctuation | 29 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 429551 | |
| a | 377525 | |
| o | 262354 | |
| r | 255846 | |
| p | 249068 | |
| i | 218439 | |
| t | 178257 | |
| d | 158092 | 5.8% |
| s | 110896 | 4.1% |
| l | 90314 | 3.3% |
| Other values (16) | 404795 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 83861 | |
| P | 47799 | |
| C | 40251 | |
| D | 32936 | 10.7% |
| A | 25439 | 8.3% |
| S | 22700 | 7.4% |
| H | 15207 | 4.9% |
| R | 9970 | 3.2% |
| T | 5215 | 1.7% |
| M | 3808 | 1.2% |
| Other values (14) | 20794 | 6.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1416 |
Space Separator
| Value | Count | Frequency (%) |
| 45 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 29 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 29 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3043117 | |
| Common | 1519 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 429551 | |
| a | 377525 | |
| o | 262354 | 8.6% |
| r | 255846 | 8.4% |
| p | 249068 | 8.2% |
| i | 218439 | 7.2% |
| t | 178257 | 5.9% |
| d | 158092 | 5.2% |
| s | 110896 | 3.6% |
| l | 90314 | 3.0% |
| Other values (40) | 712775 |
Common
| Value | Count | Frequency (%) |
| . | 1416 | |
| 45 | 3.0% | |
| [ | 29 | 1.9% |
| ] | 29 | 1.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3044636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 429551 | |
| a | 377525 | |
| o | 262354 | 8.6% |
| r | 255846 | 8.4% |
| p | 249068 | 8.2% |
| i | 218439 | 7.2% |
| t | 178257 | 5.9% |
| d | 158092 | 5.2% |
| s | 110896 | 3.6% |
| l | 90314 | 3.0% |
| Other values (44) | 714294 |
family
Text
Missing 
| Distinct | 2911 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 18609 |
| Missing (%) | 5.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 19 |
| Mean length | 10.80786103 |
| Min length | 6 |
Unique
| Unique | 283 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Depressariidae |
|---|---|
| 2nd row | Siboglinidae |
| 3rd row | Amphinomidae |
| 4th row | Cambaridae |
| 5th row | Dryopteridaceae |
| Value | Count | Frequency (%) |
| cambaridae | 12182 | 3.8% |
| geometridae | 12034 | 3.8% |
| noctuidae | 7956 | 2.5% |
| tortricidae | 7260 | 2.3% |
| plethodontidae | 6792 | 2.1% |
| poaceae | 6686 | 2.1% |
| delphinidae | 5544 | 1.7% |
| pyralidae | 5230 | 1.6% |
| siboglinidae | 5015 | 1.6% |
| vesicomyidae | 4924 | 1.5% |
| Other values (2904) | 246243 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 524407 | |
| a | 508472 | |
| i | 444074 | |
| d | 313710 | 9.1% |
| r | 190392 | 5.5% |
| o | 187351 | 5.4% |
| c | 141788 | 4.1% |
| t | 127760 | 3.7% |
| l | 120634 | 3.5% |
| n | 111227 | 3.2% |
| Other values (52) | 786874 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3134548 | |
| Uppercase Letter | 319831 | 9.3% |
| Other Punctuation | 2231 | 0.1% |
| Space Separator | 35 | < 0.1% |
| Decimal Number | 30 | < 0.1% |
| Connector Punctuation | 12 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 524407 | |
| a | 508472 | |
| i | 444074 | |
| d | 313710 | |
| r | 190392 | 6.1% |
| o | 187351 | 6.0% |
| c | 141788 | 4.5% |
| t | 127760 | 4.1% |
| l | 120634 | 3.8% |
| n | 111227 | 3.5% |
| Other values (16) | 464733 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 51983 | |
| P | 45338 | |
| G | 29376 | |
| S | 26214 | 8.2% |
| A | 23086 | 7.2% |
| T | 18933 | 5.9% |
| M | 16536 | 5.2% |
| D | 15688 | 4.9% |
| N | 13489 | 4.2% |
| L | 13475 | 4.2% |
| Other values (16) | 65713 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9 | |
| 0 | 6 | |
| 1 | 6 | |
| 3 | 6 | |
| 9 | 3 | 10.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2231 |
Space Separator
| Value | Count | Frequency (%) |
| 35 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 12 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3454379 | |
| Common | 2310 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 524407 | |
| a | 508472 | |
| i | 444074 | |
| d | 313710 | 9.1% |
| r | 190392 | 5.5% |
| o | 187351 | 5.4% |
| c | 141788 | 4.1% |
| t | 127760 | 3.7% |
| l | 120634 | 3.5% |
| n | 111227 | 3.2% |
| Other values (42) | 784564 |
Common
| Value | Count | Frequency (%) |
| . | 2231 | |
| 35 | 1.5% | |
| _ | 12 | 0.5% |
| 6 | 9 | 0.4% |
| 0 | 6 | 0.3% |
| 1 | 6 | 0.3% |
| 3 | 6 | 0.3% |
| 9 | 3 | 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3456689 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 524407 | |
| a | 508472 | |
| i | 444074 | |
| d | 313710 | 9.1% |
| r | 190392 | 5.5% |
| o | 187351 | 5.4% |
| c | 141788 | 4.1% |
| t | 127760 | 3.7% |
| l | 120634 | 3.5% |
| n | 111227 | 3.2% |
| Other values (52) | 786874 |
genus
Text
Missing 
| Distinct | 19356 |
|---|---|
| Distinct (%) | 6.2% |
| Missing | 25827 |
| Missing (%) | 7.6% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 9.340145803 |
| Min length | 2 |
Unique
| Unique | 2069 ? |
|---|---|
| Unique (%) | 0.7% |
Sample
| 1st row | Rectiostoma |
|---|---|
| 2nd row | Polystichum |
| 3rd row | Mesontoplatys |
| 4th row | Bursa |
| 5th row | Amanses |
| Value | Count | Frequency (%) |
| plethodon | 4675 | 1.5% |
| orconectes | 4553 | 1.5% |
| indet | 4240 | 1.4% |
| procambarus | 3784 | 1.2% |
| unidentified | 3704 | 1.2% |
| bathymodiolus | 2599 | 0.8% |
| riftia | 2008 | 0.6% |
| tursiops | 1921 | 0.6% |
| cambarus | 1854 | 0.6% |
| delphinus | 1663 | 0.5% |
| Other values (19347) | 281620 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 311120 | 10.7% |
| o | 246546 | 8.4% |
| i | 226277 | 7.7% |
| e | 220426 | 7.5% |
| s | 205102 | 7.0% |
| r | 186671 | 6.4% |
| t | 155628 | 5.3% |
| n | 142188 | 4.9% |
| l | 139011 | 4.8% |
| u | 122527 | 4.2% |
| Other values (54) | 964355 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2602833 | |
| Uppercase Letter | 312570 | 10.7% |
| Other Punctuation | 4244 | 0.1% |
| Decimal Number | 126 | < 0.1% |
| Connector Punctuation | 60 | < 0.1% |
| Dash Punctuation | 10 | < 0.1% |
| Space Separator | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 311120 | |
| o | 246546 | 9.5% |
| i | 226277 | 8.7% |
| e | 220426 | 8.5% |
| s | 205102 | 7.9% |
| r | 186671 | 7.2% |
| t | 155628 | 6.0% |
| n | 142188 | 5.5% |
| l | 139011 | 5.3% |
| u | 122527 | 4.7% |
| Other values (17) | 647337 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 47429 | |
| C | 36228 | |
| A | 32996 | |
| S | 23260 | 7.4% |
| M | 18527 | 5.9% |
| E | 17691 | 5.7% |
| L | 16233 | 5.2% |
| H | 15833 | 5.1% |
| T | 13931 | 4.5% |
| D | 13470 | 4.3% |
| Other values (16) | 76972 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 54 | |
| 1 | 27 | |
| 2 | 12 | 9.5% |
| 3 | 12 | 9.5% |
| 4 | 9 | 7.1% |
| 6 | 9 | 7.1% |
| 9 | 3 | 2.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4244 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 60 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10 |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2915403 | |
| Common | 4448 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 311120 | 10.7% |
| o | 246546 | 8.5% |
| i | 226277 | 7.8% |
| e | 220426 | 7.6% |
| s | 205102 | 7.0% |
| r | 186671 | 6.4% |
| t | 155628 | 5.3% |
| n | 142188 | 4.9% |
| l | 139011 | 4.8% |
| u | 122527 | 4.2% |
| Other values (43) | 959907 |
Common
| Value | Count | Frequency (%) |
| . | 4244 | |
| _ | 60 | 1.3% |
| 0 | 54 | 1.2% |
| 1 | 27 | 0.6% |
| 2 | 12 | 0.3% |
| 3 | 12 | 0.3% |
| - | 10 | 0.2% |
| 4 | 9 | 0.2% |
| 6 | 9 | 0.2% |
| 8 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2919842 | |
| None | 9 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 311120 | 10.7% |
| o | 246546 | 8.4% |
| i | 226277 | 7.7% |
| e | 220426 | 7.5% |
| s | 205102 | 7.0% |
| r | 186671 | 6.4% |
| t | 155628 | 5.3% |
| n | 142188 | 4.9% |
| l | 139011 | 4.8% |
| u | 122527 | 4.2% |
| Other values (53) | 964346 |
None
| Value | Count | Frequency (%) |
| ë | 9 |
subgenus
Text
Missing 
| Distinct | 293 |
|---|---|
| Distinct (%) | 12.7% |
| Missing | 336132 |
| Missing (%) | 99.3% |
| Memory size | 2.6 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 16 |
| Mean length | 10.68674177 |
| Min length | 3 |
Unique
| Unique | 46 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | Scapulicambarus |
|---|---|
| 2nd row | Amara |
| 3rd row | Anopheles |
| 4th row | Dipremna |
| 5th row | Abax |
| Value | Count | Frequency (%) |
| ortmannicus | 142 | 6.2% |
| pyrocera | 120 | 5.2% |
| aviticambarus | 78 | 3.4% |
| jugicambarus | 68 | 2.9% |
| creaserinus | 64 | 2.8% |
| pennides | 62 | 2.7% |
| girardiella | 56 | 2.4% |
| scapulicambarus | 47 | 2.0% |
| ochlerotatus | 42 | 1.8% |
| apiocera | 38 | 1.6% |
| Other values (283) | 1591 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3119 | |
| r | 2160 | 8.8% |
| i | 1917 | 7.8% |
| e | 1855 | 7.5% |
| s | 1832 | 7.4% |
| o | 1443 | 5.9% |
| c | 1332 | 5.4% |
| u | 1293 | 5.2% |
| n | 1221 | 5.0% |
| l | 1176 | 4.8% |
| Other values (38) | 7317 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22357 | |
| Uppercase Letter | 2308 | 9.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3119 | |
| r | 2160 | |
| i | 1917 | 8.6% |
| e | 1855 | 8.3% |
| s | 1832 | 8.2% |
| o | 1443 | 6.5% |
| c | 1332 | 6.0% |
| u | 1293 | 5.8% |
| n | 1221 | 5.5% |
| l | 1176 | 5.3% |
| Other values (15) | 5009 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 481 | |
| A | 287 | |
| C | 245 | |
| O | 200 | |
| M | 184 | 8.0% |
| S | 128 | 5.5% |
| H | 120 | 5.2% |
| E | 97 | 4.2% |
| G | 91 | 3.9% |
| D | 76 | 3.3% |
| Other values (13) | 399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 24665 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3119 | |
| r | 2160 | 8.8% |
| i | 1917 | 7.8% |
| e | 1855 | 7.5% |
| s | 1832 | 7.4% |
| o | 1443 | 5.9% |
| c | 1332 | 5.4% |
| u | 1293 | 5.2% |
| n | 1221 | 5.0% |
| l | 1176 | 4.8% |
| Other values (38) | 7317 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24665 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3119 | |
| r | 2160 | 8.8% |
| i | 1917 | 7.8% |
| e | 1855 | 7.5% |
| s | 1832 | 7.4% |
| o | 1443 | 5.9% |
| c | 1332 | 5.4% |
| u | 1293 | 5.2% |
| n | 1221 | 5.0% |
| l | 1176 | 4.8% |
| Other values (38) | 7317 |
specificEpithet
Text
Missing 
| Distinct | 23245 |
|---|---|
| Distinct (%) | 7.6% |
| Missing | 33273 |
| Missing (%) | 9.8% |
| Memory size | 2.6 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 19 |
| Mean length | 7.933020281 |
| Min length | 2 |
Unique
| Unique | 3387 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | fernaldella |
|---|---|
| 2nd row | sp. |
| 3rd row | bolzi |
| 4th row | granularis |
| 5th row | scopas |
| Value | Count | Frequency (%) |
| sp | 49913 | 16.3% |
| truncatus | 1928 | 0.6% |
| cinereus | 1834 | 0.6% |
| delphis | 1661 | 0.5% |
| porphyriticus | 816 | 0.3% |
| acutus | 779 | 0.3% |
| opacum | 765 | 0.3% |
| hoffmani | 640 | 0.2% |
| maculatus | 635 | 0.2% |
| nigripes | 624 | 0.2% |
| Other values (23227) | 245891 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 298090 | |
| i | 250360 | |
| s | 244354 | 10.1% |
| e | 177113 | 7.3% |
| r | 154433 | 6.4% |
| l | 153485 | 6.3% |
| u | 141392 | 5.8% |
| n | 141311 | 5.8% |
| t | 129210 | 5.3% |
| p | 116548 | 4.8% |
| Other values (38) | 614600 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2369241 | |
| Other Punctuation | 50232 | 2.1% |
| Decimal Number | 705 | < 0.1% |
| Space Separator | 319 | < 0.1% |
| Connector Punctuation | 219 | < 0.1% |
| Dash Punctuation | 176 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 298090 | |
| i | 250360 | |
| s | 244354 | |
| e | 177113 | 7.5% |
| r | 154433 | 6.5% |
| l | 153485 | 6.5% |
| u | 141392 | 6.0% |
| n | 141311 | 6.0% |
| t | 129210 | 5.5% |
| p | 116548 | 4.9% |
| Other values (16) | 562945 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 230 | |
| 0 | 202 | |
| 2 | 74 | 10.5% |
| 3 | 50 | 7.1% |
| 6 | 47 | 6.7% |
| 7 | 42 | 6.0% |
| 8 | 24 | 3.4% |
| 5 | 16 | 2.3% |
| 9 | 10 | 1.4% |
| 4 | 10 | 1.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 49950 | |
| " | 260 | 0.5% |
| / | 10 | < 0.1% |
| ? | 5 | < 0.1% |
| , | 4 | < 0.1% |
| ' | 2 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 319 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 219 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 176 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2369241 | |
| Common | 51655 | 2.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 298090 | |
| i | 250360 | |
| s | 244354 | |
| e | 177113 | 7.5% |
| r | 154433 | 6.5% |
| l | 153485 | 6.5% |
| u | 141392 | 6.0% |
| n | 141311 | 6.0% |
| t | 129210 | 5.5% |
| p | 116548 | 4.9% |
| Other values (16) | 562945 |
Common
| Value | Count | Frequency (%) |
| . | 49950 | |
| 319 | 0.6% | |
| " | 260 | 0.5% |
| 1 | 230 | 0.4% |
| _ | 219 | 0.4% |
| 0 | 202 | 0.4% |
| - | 176 | 0.3% |
| 2 | 74 | 0.1% |
| 3 | 50 | 0.1% |
| 6 | 47 | 0.1% |
| Other values (12) | 128 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2420896 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 298090 | |
| i | 250360 | |
| s | 244354 | 10.1% |
| e | 177113 | 7.3% |
| r | 154433 | 6.4% |
| l | 153485 | 6.3% |
| u | 141392 | 5.8% |
| n | 141311 | 5.8% |
| t | 129210 | 5.3% |
| p | 116548 | 4.8% |
| Other values (38) | 614600 |
Missing 
| Distinct | 1864 |
|---|---|
| Distinct (%) | 15.8% |
| Missing | 326664 |
| Missing (%) | 96.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 9.024456522 |
| Min length | 3 |
Unique
| Unique | 736 ? |
|---|---|
| Unique (%) | 6.2% |
Sample
| 1st row | cinereus |
|---|---|
| 2nd row | benjamina |
| 3rd row | mexicana |
| 4th row | doliatus |
| 5th row | pallidirostris |
| Value | Count | Frequency (%) |
| pennsylvanicus | 615 | 5.2% |
| cinereus | 493 | 4.2% |
| insignis | 267 | 2.3% |
| melas | 246 | 2.1% |
| talpoides | 246 | 2.1% |
| noveboracensis | 196 | 1.7% |
| dickeyi | 167 | 1.4% |
| dorsalis | 125 | 1.1% |
| cherriei | 124 | 1.1% |
| sacarensis | 107 | 0.9% |
| Other values (1857) | 9195 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 12488 | |
| s | 11660 | |
| a | 11265 | |
| e | 9324 | |
| n | 8882 | 8.4% |
| r | 6808 | 6.4% |
| u | 6451 | 6.1% |
| c | 5916 | 5.6% |
| l | 5451 | 5.1% |
| o | 5239 | 4.9% |
| Other values (21) | 22788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 106259 | |
| Space Separator | 5 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 12488 | |
| s | 11660 | |
| a | 11265 | |
| e | 9324 | |
| n | 8882 | 8.4% |
| r | 6808 | 6.4% |
| u | 6451 | 6.1% |
| c | 5916 | 5.6% |
| l | 5451 | 5.1% |
| o | 5239 | 4.9% |
| Other values (16) | 22775 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 106259 | |
| Common | 13 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 12488 | |
| s | 11660 | |
| a | 11265 | |
| e | 9324 | |
| n | 8882 | 8.4% |
| r | 6808 | 6.4% |
| u | 6451 | 6.1% |
| c | 5916 | 5.6% |
| l | 5451 | 5.1% |
| o | 5239 | 4.9% |
| Other values (16) | 22775 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| - | 4 | |
| . | 2 | 15.4% |
| ( | 1 | 7.7% |
| ) | 1 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 106272 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 12488 | |
| s | 11660 | |
| a | 11265 | |
| e | 9324 | |
| n | 8882 | 8.4% |
| r | 6808 | 6.4% |
| u | 6451 | 6.1% |
| c | 5916 | 5.6% |
| l | 5451 | 5.1% |
| o | 5239 | 4.9% |
| Other values (21) | 22788 |
taxonRank
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 326679 |
| Missing (%) | 96.5% |
| Memory size | 2.6 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.75733356 |
| Min length | 3 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | subspecies |
|---|---|
| 2nd row | variety |
| 3rd row | subspecies |
| 4th row | subspecies |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 10856 | |
| variety | 846 | 7.2% |
| forma | 39 | 0.3% |
| var | 18 | 0.2% |
| agg | 1 | < 0.1% |
| fo | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 32568 | |
| e | 22558 | |
| i | 11702 | 10.2% |
| b | 10856 | 9.5% |
| p | 10856 | 9.5% |
| c | 10856 | 9.5% |
| u | 10856 | 9.5% |
| a | 904 | 0.8% |
| r | 903 | 0.8% |
| t | 846 | 0.7% |
| Other values (8) | 1851 | 1.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 114611 | |
| Uppercase Letter | 125 | 0.1% |
| Other Punctuation | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 32568 | |
| e | 22558 | |
| i | 11702 | 10.2% |
| b | 10856 | 9.5% |
| p | 10856 | 9.5% |
| c | 10856 | 9.5% |
| u | 10856 | 9.5% |
| a | 904 | 0.8% |
| r | 903 | 0.8% |
| t | 846 | 0.7% |
| Other values (6) | 1706 | 1.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 125 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 114736 | |
| Common | 20 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 32568 | |
| e | 22558 | |
| i | 11702 | 10.2% |
| b | 10856 | 9.5% |
| p | 10856 | 9.5% |
| c | 10856 | 9.5% |
| u | 10856 | 9.5% |
| a | 904 | 0.8% |
| r | 903 | 0.8% |
| t | 846 | 0.7% |
| Other values (7) | 1831 | 1.6% |
Common
| Value | Count | Frequency (%) |
| . | 20 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 114756 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 32568 | |
| e | 22558 | |
| i | 11702 | 10.2% |
| b | 10856 | 9.5% |
| p | 10856 | 9.5% |
| c | 10856 | 9.5% |
| u | 10856 | 9.5% |
| a | 904 | 0.8% |
| r | 903 | 0.8% |
| t | 846 | 0.7% |
| Other values (8) | 1851 | 1.6% |
Missing 
| Distinct | 8732 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 174042 |
| Missing (%) | 51.4% |
| Memory size | 2.6 MiB |
Length
| Max length | 60 |
|---|---|
| Median length | 52 |
| Mean length | 9.057299967 |
| Min length | 2 |
Unique
| Unique | 1928 ? |
|---|---|
| Unique (%) | 1.2% |
Sample
| 1st row | (Riley) |
|---|---|
| 2nd row | (Roding) |
| 3rd row | Krylova & Moskalev |
| 4th row | Kearfott |
| 5th row | (Leconte) |
| Value | Count | Frequency (%) |
| 18497 | 7.9% | |
| linnaeus | 4238 | 1.8% |
| l | 3882 | 1.7% |
| walker | 3705 | 1.6% |
| barnes | 3618 | 1.5% |
| mcdunnough | 3336 | 1.4% |
| hobbs | 3050 | 1.3% |
| dyar | 2658 | 1.1% |
| busck | 2449 | 1.0% |
| grote | 2439 | 1.0% |
| Other values (4970) | 186253 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 122229 | 8.2% |
| a | 106115 | 7.1% |
| r | 99205 | 6.7% |
| n | 88200 | 5.9% |
| 69727 | 4.7% | |
| o | 67794 | 4.6% |
| l | 64822 | 4.4% |
| i | 63701 | 4.3% |
| s | 62738 | 4.2% |
| ( | 55139 | 3.7% |
| Other values (77) | 689332 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1027746 | |
| Uppercase Letter | 220001 | 14.8% |
| Space Separator | 69727 | 4.7% |
| Other Punctuation | 59119 | 4.0% |
| Open Punctuation | 55139 | 3.7% |
| Close Punctuation | 55139 | 3.7% |
| Dash Punctuation | 2131 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 122229 | |
| a | 106115 | |
| r | 99205 | 9.7% |
| n | 88200 | 8.6% |
| o | 67794 | 6.6% |
| l | 64822 | 6.3% |
| i | 63701 | 6.2% |
| s | 62738 | 6.1% |
| u | 50039 | 4.9% |
| t | 47035 | 4.6% |
| Other values (38) | 255868 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 23903 | |
| H | 21389 | 9.7% |
| M | 19083 | 8.7% |
| S | 18205 | 8.3% |
| L | 17414 | 7.9% |
| D | 15350 | 7.0% |
| C | 14994 | 6.8% |
| G | 13194 | 6.0% |
| W | 12869 | 5.8% |
| R | 9509 | 4.3% |
| Other values (21) | 54091 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 40323 | |
| & | 18497 | |
| ' | 237 | 0.4% |
| , | 62 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 69727 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 55139 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 55139 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2131 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1247747 | |
| Common | 241255 | 16.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 122229 | 9.8% |
| a | 106115 | 8.5% |
| r | 99205 | 8.0% |
| n | 88200 | 7.1% |
| o | 67794 | 5.4% |
| l | 64822 | 5.2% |
| i | 63701 | 5.1% |
| s | 62738 | 5.0% |
| u | 50039 | 4.0% |
| t | 47035 | 3.8% |
| Other values (69) | 475869 |
Common
| Value | Count | Frequency (%) |
| 69727 | ||
| ( | 55139 | |
| ) | 55139 | |
| . | 40323 | |
| & | 18497 | 7.7% |
| - | 2131 | 0.9% |
| ' | 237 | 0.1% |
| , | 62 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1485509 | |
| None | 3493 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 122229 | 8.2% |
| a | 106115 | 7.1% |
| r | 99205 | 6.7% |
| n | 88200 | 5.9% |
| 69727 | 4.7% | |
| o | 67794 | 4.6% |
| l | 64822 | 4.4% |
| i | 63701 | 4.3% |
| s | 62738 | 4.2% |
| ( | 55139 | 3.7% |
| Other values (50) | 685839 |
None
| Value | Count | Frequency (%) |
| ü | 1265 | |
| é | 848 | |
| è | 433 | 12.4% |
| ö | 240 | 6.9% |
| ø | 171 | 4.9% |
| ä | 134 | 3.8% |
| á | 82 | 2.3% |
| ê | 63 | 1.8% |
| É | 59 | 1.7% |
| å | 38 | 1.1% |
| Other values (17) | 160 | 4.6% |